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NUCLEOTIDE SEQUENCES OF GALACTINOL SYNTHASE 
FROM ZUCCHINI AND SOYBEAN 
5 BACKGROUND OF THE INVENTION 

Raffinose saccharides are a group of D-galactose- 
containing oligosaccharide derivatives of sucrose that 
are widely distributed in plants. Raffinose saccharides 
are characterized by the general formula: [O-fi—D- 

10 galactopyranosyl- (1 ^ 6) n -a--glucopyranosyl- (1 -»2)-£-D- 
fructofuranoside where n=0 through n=4 are known 
respectively as sucrose, raffinose, stachyose, 
verbascose, and ajugose. 

Extensive botanical surveys of the occurrence of 

15 raffinose saccharides have been reported in the 

scientific literature [see Dey (1985) in Biochemistry of 
Storage Carbohydrates in Green Plants, P. M. Dey and 
R. A. Dixon, Eds. Academic Press, London, pp. 53-129]. 
Raffinose saccharides are thought to be second only to 

20 sucrose among the nonstructural carbohydrates with 
respect to abundance in the plant kingdom. In fact, 
raffinose saccharides may be ubiquitous, at least among 
higher plants. Raffinose saccharides accumulate in 
significant quantities in the edible portion of many 

25 economically-significant crop species. Examples include 
soybean (glycine max L. Merrill), sugar beet (JBsta 
vulgaris) , cotton (Gossypium hirsutum l . ) , canola 
(Brassica sp.) and all of the major edible leguminous 
crops including beans (Phaseolna S p.), chick pea 

30 arietinum) , cowpea (Vigna uncruiculat.al r mung bean ( Viana 
radiata) , peas (Eisiim sativum) / lentil (Len^ culinari si 
and lupine ( Lupimis sp . ) . 

Although abundant in many species, raffinose 
saccharides are an obstacle to the efficient utilization 
35 of some economically-important crop species . Raffinose 
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saccharides are not digested directly by animals, 
primarily because a-galactosidase- is not present in the 
intestinal mucosa [Gitzelmann et al. (1965) Pediatrics 
36:231-236; Rutloff et al. (1967) Nahrung 11:39-46]. 
5 However, microflora in the lower gut are readily able to 
ferment the raffinose saccharides resulting in an 
acidification of the gut and production of carbon 
dioxide, methane and hydrogen [Murphy et al. (1972) J. 
Agr. Food. Chem. 20: 813-817; Cristofaro et al. (1974) 

10 in. Sugars in Nutrition, H. L. Sipple and K. W. McNutt, 
Eds. Academic Press, New York, Chap. 20, 313-335; Reddy 
et al. (1980) J. Food Science 45:1161-1164]. The 
resulting flatulence can severely limit the use of 
leguminous plants in animal, particularly human, diets. 

15 It is unfortunate that the presence of raffinose 

saccharides restricts the use of legumes in human diets 
because many of these species are otherwise excellent 
sources of protein and soluble fiber. Varieties of 
edible beans free of raffinose saccharides would be more 

20 valuable for human diets and would more fully use the 
desirable nutritional qualities of edible leguminous 
plants . 

Soybean meal is the principal source of protein in 
animal feed, especially feed for monogastric animals 

25 such as poultry and swine. Approximately 28 million 

metric tons of soybean meal were produced in the U.S. in 
1988 [Oil Crops Situation and Outlook Report (Apr 1989) 
U.S. Dept. of Agriculture, Economic Research Service]. 
Soybean meal is produced by treating soybeans with 

30 hexane to remove the oil and then toasting the extracted 
material to remove the residual solvent. Although the 
soybean is an excellent source of vegetable protein, 
there are inefficiencies associated with its use that 
appear to be due to the presence of raffinose 

35 saccharides. Compared to maize, the other primary 
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ingredient in animal diets, gross energy utilization for 
soybean meal is low [see Potter et al. (1984) in 
Proceedings World Soybean Conference III, 218-224]. For 
example, although soybean meal contains approximately 6% 
5 more gross energy than ground yellow corn, it has about' 
40 to 50% less metabolizable energy when fed to 
chickens. This inefficiency of gross energy utilization 
does not appear to be due to problems in digestion of 
the protein fraction of the meal, but rather due to the 

10 poor digestion of the carbohydrate portion of the meal. 
It has been reported that removal of raffinose 
saccharides from soybean meal by ethanol extraction 
results in a large increase in the metabolizable energy 
for broiler chickens [Coon et al. (1988) Proceedings 

15 Soybean Utilization Alternatives, University of 
Minnesota, 203-211] . Removal of the raffinose 
saccharides was associated with increased utilization of 
the cellulosic and hemicellulosic fractions of the 
soybean meal. Soybean varieties free of raffinose 

20 saccharides could be used to produce meals that would 
have added value for individuals who either produce 
soybean meal for animal feed or use soybean meal as a 
major component in the diets for their animals. 

In addition to its use in animal diets, soybeans 

25 are used to produce enriched sources of vegetable 

protein for human use. Examples of soybeans in human 
foods include soy protein concentrate, textured soy 
protein and infant formula. Facilities and methods to 
produce protein isolates from soybeans are available 

30 across the U.S. One of the unsolved challenges faced by 
producers of soy protein isolates is selectively 
purifying the protein away from the raffinose 
saccharides. Considerable added costs result from 
removing the large amounts of raffinose saccharides that 

35 are present in soybeans. Again, soybean varieties free 
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of raffinose saccharides would reduce the cost of 
producing soy protein products as well as improve the 
nutritional quality of the end product. 

Other agronomically-important crops such as cotton 
5 and canola are also used as secondary sources of protein 
for animal diets. Meals produced from seeds of these 
species also contain raffinose saccharides. The effect 
of raffinose saccharides on the nutritional quality of 
cottonseed and canola meal has received little or no 

10 attention, but it is possible that raffinose saccharides 
are as great a barrier to the use of these meals as they 
appear to be with soybean. Both cotton and canola meal 
have less value than soybean meal for metabolizable 
energy to animals [Feedstuffs (1990) Reference Issue 

15 62:24-31]- Varieties of cotton or canola free of 

raffinose saccharides may have added value for use in 
animal feed. 

An additional problem associated with the presence 
of raffinose in plants occurs in the production of 

20 sucrose from sugar beets. The small amount of 

raffinose, ca. 0.05%, compared to sucrose, ca. 16%, in 
expressed beet juice is sufficient to decrease the 
efficiency of crystallization of sucrose. As a result, 
sugar manufacturers have resorted to the use of 

25 immobilized a-galactosidase to reduce the content of 

molasses during the refining of sugar beet juice [Linden 
(1982) Enzyme Microb. Technol. 4:130-136]. Sugar beet 
varieties free of raffinose would not need this 
additional processing and therefore would have added 

30 value to sugar beet processors. 

Although nutritional and economic problems are 
associated with the presence of raffinose saccharides in 
many crops, certain benefits are also ascribed to this 
family of oligosaccharides. Seed viability has been 

35 correlated positively with the presence of raffinose 
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[Ovacharov et al. (1974) Fiziol. Rast. 21:969-974; 
Caffrey et al. (1988) Plant Physiol . 86:754-758; 
Schleppi et al. (1989) Iowa Seed Science 11:9-12]. It 
is thought that raffinose helps maintain the integrity 
5 of the membranes of seeds as they undergo the 

desiccation process during maturation. Raffinose also 
may play an important role in the cryoprotection of 
plants. The accumulation of raffinose in plants exposed 
to cold temperature has been indicated in a number of 

10 species [Parker (1959) Bot. Gaz. 121:46-50; Alden et al. 
(1971) Bot. Rev. 37:37-142, see Kandler et al. (1982) in 
Encyclopedia of Plant Physiology, New Series, Vol. 
13A: 348-383; Mitcham-Butler et al. (1987) J. Amer. Soc. 
Hort. Sci. 112:672-676; Castillo et al. (1990) J. Agric. 

15 Food Chem. 38:351-355]. This accumulation of raffinose 
may be responsible for protection of chloroplasts and 
has been shown to be highly correlated with the 
postharvest retention of needles in horticulturally- 
important coniferous species such as Fraser fir [Ab±£S. 

20 fraser j (Pursh) Poir.] and white pine (EimiS. strobus 
L.). Retention of needles in these species affects 
their quality for use as ornamental plants, such as 
Christmas trees. Producing plants with increased 
amounts of raffinose saccharides may increase cold 

25 hardiness resulting in increased post-harvest quality, a 
greater ability to withstand cold temperatures or 
provide producers greater flexibility when harvesting 
trees . 

In spite of the problems associated with the 
30 presence of raffinose saccharides in soybean products 

for human use, for certain food applications increasing 
the amount of fermentable carbohydrates present in the 
soybean offers some advantages. [Economic Implications 
of Modified Soybean Traits (1990) Special Report ISSN: 
35 0361-199X Iowa State Report] . Examples include oriental 
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foods such, as tofu, tempeh, natto, and soy sauce. In 
these applications, soybean varieties with increased 
amounts of readily-fermentable sugars such as sucrose 
and the raffinose saccharides would have added value for 
5 the producers of these products. 

The biosynthesis of raffinose saccharides has been 
fairly well characterized [see Dey (1985) in 
Biochemistry of Storage Carbohydrates in Green Plants, 
P. M. Dey and R. A, Dixon, Eds. Academic Press, London, 

10 pp. 53-129]. The committed reaction of raffinose 
saccharide biosynthesis involves the synthesis of 
galactinol from UDP-galactose and myo-inositol . The 
enzyme that catalyzes this reaction is galactinol 
synthase. Synthesis of raffinose and higher homologues 

15 in the raffinose saccharide family from sucrose is 
thought to be catalyzed by distinct 

galactosyltransf erases (for example, raff inose synthase 
and stachyose synthase) . Studies with many species 
suggest that galactinol synthase is the key enzyme 

20 controlling the flux of reduced carbon into the 

biosynthesis of raffinose saccharides [Handley et al. 
(1983) J. Amer. Soc. Hort. Sci. 108:600-605; Saravitz, 
et al. (1987) Plant Physiol. 83:185-189]. Altering the 
activity of galactinol synthase, either as a result of 

25 overexpression or through antisense inhibition, would 

change the amount of raffinose saccharides produced in a 
given tissue. 

In order to alter the activity of galactinol 
synthase using molecular biological approaches it is 

30 essential to isolate the gene(s) or cDNA(s) encoding the 
enzyme. There are no published reports for the 
purification of homogeneous galactinol synthase which 
would allow one to prepare DNA probes based on amino 
acid sequence information. Applicants describe here the 

35 purification of galactinol synthase from zucchini 
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( Cucurbita pepo ) and the subsequent cloning and use of 
galactinol synthase-encoding nucleotide sequences from 
zucchini, soybean, and canola. 

SUMMARY OF THE INVENTION 
5 The invention can be more fully understood from the 

following detailed description, the accompanying 
drawings and the Sequence Descriptions which form a part 
of this application. The Sequence Descriptions contain 
the three letter codes for amino acids as defined in 37 

10 C.F.R. 1.822 which are incorporated by reference herein. 
The citation herein of any patents, pending U.S. 
applications, and any other disclosure that was 
available to the public as of the filing date of the 
instant application are incorporated herein by reference 

15 in their entirety. 

A means has been discovered to produce transgenic 
plants and microorganisms in which a galactinol synthase 
gene overexpresses or underexpresses yielding raffinose 
saccharides in altered (higher than normal or lower than 

20 normal) levels as compared to non-transgenic plants or 
microorganisms. Transgenic plants that produce higher 
than normal levels of raffinose saccharides possess 
enhanced cold tolerance and in coniferous species will 
result in reduced postharvest needle abscission. 

25 Transgenic plants that produce lower than normal levels 
of raffinose saccharides are more easily digestible and 
have available larger amounts of metabolizable energy. 
In sugar beets in particular, a decrease in raffinose 
saccharide content would improve sucrose crystallization 

30 and overcome the need for expensive processing of sugar 
beet extracts. 

One aspect of the invention is an isolated nucleic 
acid fragment comprising a nucleotide sequence encoding 
plant galactinol synthase. The fragment is derived from 

35 soybean, zucchini, canola, cotton, edible legumes, sugar 
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beet,, coniferous horticultural species, Stachyg r maize 
or tobacco. More prefereably, the fragment is derived 
from soybean, zucchini, or canola. 

Another aspect of the invention is an isolated 
5 nucleic acid fragment comprising a nucleotide sequuences 
corresponding to or substantially homologous to SEQ ID 
NO: 5 or SEQ ID No: 6 encoding plant galactinol 
synthase. 

Another aspect of the invention is a chimeric gene 

10 capable of being expressed in transformed plants that 

comprises an isolated nucelotide sequence encoding plant 
galactinol synthase and suitable regulatory sequences 
such that upon transformation said plant overexpresses 
raffinose saccharides relative to non- trans formed 

15 plants. A related aspect of the invention is a chimeric 
gene capable of being expressed in transformed plants 
that comprises an isolated nucelotide sequence encoding 
antisense RNA complementary to plant galactinol synthase 
and suitable regulatory sequences such that upon 

20 transformation said plant underexpresses raffinose 
saccharides relative to non-transformed plants. A 
further aspect of the invention is a chimeric gene 
capable of being expressed in transformed microorganisms 
comprising nuceic acid fragments encoding plant 

25 galactinol synthase and suitable requlatory sequences. 

A further aspect of the invention is a host cell 
transformed with a chimeric gene capable of being 
expressed in transformed plants, the gene comprising a 
nucleotide sequence encoding plant galactinol synthase 

30 and a suitable regulatory sequence. 

An additional aspect of the invention is a plant 
transformed with a chimeric gene capable of being 
expressed in transformed plants, the gene comprising a 
nucleotide sequence encoding plant galactinol synthase 

35 and a suitable regulatory sequence. Such transformed 
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plant overexpresses or underexpresses raffinose 
saccharides relative to non-transformed plants- Such 
transformed plant is a higher plant including soybean, 
zucchini, canola, cotton, edible legumes, sugar beet, 
5 coniferous horticultural species, Stachys , maize and 
tobacco. More preferably, the transformed plant is 
soybean, zucchini or canola. 

A further important aspect of the invention is seed 
obtained from plants transformed with a chimeric gene 

10 capable of being expressed in transformed plants, the 
gene comprising a nucleotide sequence encoding plant 
galactinol synthase and a suitable regulatory sequence. 
Such seeds are obtained from transformed plants 
including transformed soybean, zucchini, canola, cotton, 

15 edible legumes, sugar beet, coniferous horticultural 

species , Stachys > maize and tobacco. More preferably, 
the transformed plant is soybean, zucchini or canola. 

A further aspect of the invention is a 
microorganism transformed with a chimeric gene capable 

20 of being expressed in a microorganism, the gene 
comprising a nucleotide sequence encoding plant 
galactinol synthase and a suitable regulatory sequence. 
Most preferably, the transformed micoorganism is E , 
coli . 

25 Another aspect of the invention is a method for 

obtaining plants and plant cells containing altered 
(higher or lower than that of non-transformed plants) 
levels of raffinose saccharides and or sucrose . This 
method is performed by 

30 (a) transforming a plant cell with a chimeric 

gene that encodes antisense RNA complementary to 
galactinol synthase; 

(b) growing fertile plants from said 
transformed plant cell; and 
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(c) screening progeny plants for the desired 
levels of raffinose saccharides. 

A further aspect of the invention is a method for 
producing galactinol. This method involves 
5 (a) transforming £. with the chimeric 

gene of Claim 5; 

(b) growing said £• £Oli under conditions 

suitable for expression of galactinol synthase; 

(c) isolating a galactinol synthase enzyme 
10 from the £. coli of step (b) ; and 

(d) contacting said isolated galactinol 
synthase enzyme with appropriate substrates in order to 
produce galactinol. 

A further aspect of the invention is another method 
15 for producing galactinol. This method involves 

(a) isolating a galactinol synthase enzyme 

from a plant; and 

(b) contacting said isolated glactinol 
synthase enzyme with appropriate substrates in order to 

20 produce galactinol. 

An aspect of this invention is a method of RFLP 
breeding to produce altered levels of raffinose 
saccharides and sucrose traits in soybeans. This method 
involves 

25 (a) crossing between two soybean varieties 

differing in the traits for raffinose saccharides and 

sucrose levels; 

(b) making a Southern blot of restriction 
enzyme digested genomic DNA isolated from several 

30 progeny plants resulting from the cross of (a) ; and 

(c) hybridizing the Southern blot with the 
radiolabeled nucleic acid fragment comprising a 
nucleotide sequence encoding plant galactinol synthase. 

A further aspect of this invention is a method of 
35 varying the level of D-galactose containing 
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oligosaccharides or sucrose in plants in response to 
end-user requirements. This method involves 

(a) combining the nucleic acid fragment of 
SEQ ID NO: 5 or 6 with suitable regulatory sequences for 

5 expression and localization in plant tissues; 

(b) transforming a plant cell with the 
product of step (a) ; 

(c) regenerating plants from said transformed 
plant cell of step (b) to obtain mature plants; 

10 (d) screening the seeds of the plants of step 

(c) for the desired variation in amino acid level. 

BRIEF DESCRIPTION OF THE SEQ UENCE LTSTTNflS 
SEQ ID NOs:l - 4 show the nucleotide sequences far 
four degenerate oligonucleotides used in PCR of zucchini 
15 cDNA. 

SEQ ID NO: 5 shows the nucleotide sequence of 
zucchini leaf galactinol synthase cDNA. The nucleotide 
1 is the first nucleotide following the Eco Rl 
restriction site used in cloning the cDNA, reading from 

20 5' to 3' on the cDNA insert, and nucleotide 1265 is the 
last nucleotide of the cDNA insert, immediately before 
the first nucleotide of the Xhol restriction cloning 
site of plasmid p812. Nucleotides 1 to 32 are the 5' 
untranslated sequence, nucleotides 33 to 35 are the 

25 translation initiation codon, nucleotides 1023 to 1025 
are the termination codon, and nucleotides 1026 to 1265 
are the 3* untranslated sequence. 

SEQ ID NO: 6 shows the nucleotide sequence of 
soybean seed galactinol synthase cDNA. The nucleotide 1 

30 is the first nucleotide following the Pst 1 restriction 
site, reading from 5' to 3 1 on the cDNA insert, 
nucleotide 1386 is the last nucleotide of the cDNA 
insert, immediately before the first nucleotide of the 
Kpn 1 restriction site of plasmid pS21. Nucleotides 1 

35 to 138 are the 5' untranslated sequence, nucleotides 13 9 
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to 141 are the translation initiation codon, nucleotides 
1123 to 1125 are the termination codon, and nucleotides 
1126 to 1386 are the 3' untranslated sequence. 

SEQ ID NOs:7-10 show amino acid sequences of 
5 peptide fragments from zucchini leaf galactinol 

synthase. SEQ ID NOs:7-9 are internal peptide fragments 
generated via CNBr cleavage. SEQ ID NO: 10 is the amino 
acid sequence of the N-terminus of the .galactinol 
synthase holoenzyme. 
10 SEQ ID NO: 11 shows the 5* junction between the 

vector and insert sequences of p812. 

SEQ ID NO: 12 shows converted 5 1 junction between 
vector and insert sequences of p8125, resulting in an 

open reading frame. 
15 nRTATLED D ESCRIPTION OF THE INVENTION 

The present invention describes two nucleic acid 

fragments that encode soybean seed and zucchini leaf 

galactinol synthase. These enzymes catalyze the 

synthesis of galactinol. The only known function of 

20 galactinol is as the galactosyl donor in raffinose 

saccharide biosynthesis. Transfer of either of these 
nucleic acid fragments or portions thereof that encode a 
functional enzyme with suitable regulatory sequences 
into a living cell will result in the production or 

25 over-production of galactinol synthase. 

Transfer of the nucleic acid fragment encoding 
galactinol synthase in soybean into a soybean plant with 
suitable regulatory sequences that transcribe the 
antisense RNA complementary to the mRNA f or its 

30 precursor, will result in the inhibition of the 

expression of the endogenous galactinol synthase gene 
and, consequently, in reduced amounts of galactinol 
synthase relative to the untransformed soybean. 

Following transformation of an appropriate cell 

35 host such as £- coli with the instant nucleic acid 
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fragments, galactinol synthase can then be overexpressed 
relative to the untransformed host cell. Subsequent 
isolation and contact of the enzyme with the appropriate 
substrates results in the production of galactinol. 
5 The instant soybean nucleic acid fragments can also 

be used as restriction fragment length polymorphism 
(RFLP) markers in soybean genetic studies and breeding 
programs . 

In the context of this disclosure , a number of 

10 terms shall be utilized. As used herein, the term 

"nucleic acid" refers to a large molecule which can be 
single stranded or double stranded, composed of monomers 
(nucleotides) containing a sugar, phosphate and either a 
purine or pyrimidine. A "nucleic acid fragment" is a 

15 fraction of a given nucleic acid molecule. In higher 
.plants, deoxyribonucleic acid (DNA) is the genetic 
material while ribonucleic acid (RNA) is involved in the 
transfer of the information in DNA into proteins. A 
"genome" is the entire body of genetic material 

20 contained in each cell of an organism. The term 

"nucleotide sequence" refers to a polymer of DNA or RNA 
which can be single- or double-stranded, optionally 
containing synthetic, non-natural or altered nucleotide 
bases capable of incorporation into DNA or RNA polymers. 

25 As used herein, the term "homologous to" refers to the 
complementarity between the nucleotide sequence of two 
nucleic acid molecules or between the amino acid 
sequences of two protein molecules. Estimates of such 
homology are provided by either DNA-DNA or DNA-RNA 

30 hybridization under conditions of stringency as is well 
understood by those skilled in the art [as described in 
Hames and Higgins, Eds. (1985) Nucleic Acid 
Hybridisation, IRL Press, Oxford, U.K.] ; or by the 
comparison of sequence similarity between two nucleic 

35 acids or proteins. As used herein, "substantially 
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homologous" refers to nucleic acid molecules which 
require less stringent conditions of hybridization than 
those for homologous sequences, and coding DNA sequence 
which may involve base changes that do not cause a 
5 change in the encoded amino acid, or which involve base 
changes which may alter an amino acid, but not affect 
the functional properties of the protein encoded by the 

DNA sequence. 

Thus, the nucleic acid fragments described herein 

10 include molecules which comprise possible variations of 
the nucleotide bases derived from deletion, 
rearrangement, random or controlled mutagenesis of the 
nucleic acid fragment, and even occasional nucleotide 
sequencing errors so long as the DNA sequences are 

15 substantially homologous . 

"Gene" refers to a nucleic acid fragment that 
expresses a specific protein, including regulatory 
sequences preceding (5 l non-coding) and following (3 f 
non-coding) the coding region. "Galactinol synthase 

20 gene" refers to a nucleic acid fragment that expresses a 
protein with galactinol synthase activity. "Native" 
gene refers to the gene as found in nature with its own 
regulatory sequences. "Chimeric gene" refers to a gene 
that comprises heterogeneous regulatory and coding 

25 sequences. "Endogenous" gene refers to the native gene 
normally found in its natural location in the genome. A 
"foreign" gene refers to a gene not normally found in 
the host organism but that is introduced by gene 
transfer. 

30 "Coding sequence" refers to. a DNA sequence that 

codes for a specific protein and excludes the non-coding 
sequences. It may constitute an "uninterrupted coding 
sequence" (i.e., lacking an intron, such as in a cDNA) 
or it may include one or more introns bounded by 

35 appropriate splice junctions. An "intron" is a sequence 
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of RNA which is transcribed in the primary transcript 
but which is removed through cleavage and re-ligation of 
the RNA within the cell to create the mature mRNA that 
can be translated into a protein. 
5 "Initiation codon" and "termination codon" refer to 

a unit of three adjacent nucleotides in a coding 
sequence that specifies initiation and chain 
termination respectively, of protein synthesis (mRNA 
translation) . "Open reading frame" refers to the amino 

10 acid sequence encoded between translation initiation and 
termination codons of a coding sequence. 

"RNA transcript" refers to the product resulting 
from RNA polymerase-catalyzed transcription of a DNA 
sequence. When the RNA transcript is a perfect 

15 complementary copy of the DNA sequence, it is referred 
to as the primary transcript or it may be a RNA sequence 
derived from posttranscriptional processing of the 
primary transcript and is referred to as the mature RNA. 
"Messenger RNA (mRNA) refers to the RNA that is without 

20 introns and that can be translated into protein by the 
cell- "cDNA" refers to a double- stranded DNA that is 
complementary to and derived from mRNA. "Sense" RNA 
refers to RNA transcript that include the mRNA. 
"Antisense RNA" refers to a RNA transcript that is 

25 complementary to all or part of a target primary 

transcript or mRNA and that blocks the expression of a 
target gene by interfering with the processing, 
transport and/or translation of its primary transcript 
or mRNA. The complementarity of an antisense RNA may be 

30 with any part of the specific gene transcript, i.e., at 
the 5' non-coding sequence, 3 1 non-coding sequence, 
introns, or the coding sequence. In addition, as used 
herein, antisense RNA may contain regions of ribozyme 
sequences that may increase the efficacy of antisense 

35 RNA to block gene expression. "Ribozyme" refers to a 
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catalytic RNA and includes sequence-specific 
endoribonucleases . 

As used herein, "suitable regulatory sequences" 
refer to nucleotide sequences located upstream (5 f ), 
5 within, and/or downstream (3 f ) to a coding sequence, 

which control the transcription and/or expression of the 
coding sequences, potentially in conjunction with the 
protein biosynthetic apparatus of the cell. In 
artificial DNA constructs regulatory sequences can also 
10 control the transcription and stability of antisense 
RNA. 

"Promoter" refers to a DNA sequence in a gene, 
usually upstream (5') to its coding sequence, which 
controls the expression of the coding sequence by 

15 providing the recognition for RNA polymerase and other 
factors required for proper transcription. In 
artificial DNA constructs, promoters can also be used to 
transcribe antisense RNA. Promoters may also contain 
DNA sequences that are involved in the binding of 

20 protein factors which control the effectiveness of 

transcription initiation in response to physiological or 
developmental conditions. It may also contain enhancer 
elements. An "enhancer" is a DNA sequence which can 
stimulate promoter activity. It may be an innate 

25 element of the promoter or a heterologous element 

inserted to enhance the level and/or tissue-specificity 
of a promoter. "Constitutive promoters" refers to those 
that direct gene expression in all tissues and at all 
times. "Tissue-specific" or "development-specific" 

30 promoters as referred to herein are those that direct 
gene expression almost exclusively in specific tissues, 
such as leaves or seeds, or at specific development 
stages in a tissue, such as in early or late 
embr yogene sis, re spect ively . 
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The term "expression"/ as used herein, is intended 
to mean the production of a functional end-product. In 
the case of expression or overexpression of the 
galactinol synthase genes it involves transcription of 
5 the gene and translation of the mRNA into precursor or 
mature galactinol synthase proteins. In the case of 
antisense inhibition it refers to the production of 
antisense RNA transcripts capable of preventing the 
expression of the target protein. "Overexpression" 
10 refers to the production of a gene product in transgenic 
organisms that exceeds levels of production in normal or 
non- trans formed organisms. 

As used herein, "total a-galactoside" content 
refers to all a-linked carbohydrate that is soluble in 

15 the solvent system described herein and is capable of 
being assayed using the a-galactosidase/galactose 

dehydrogenase method described herein. "Total raff inose 
saccharides" refers to the a-galactose content present 
in the sum of stachyose (2 mol a-galactose/mol) , 
20 raff inose (1 mol Ot-galactose/mol) and galactinol (1 mol 
a-galactose/mole) as determined by methods described 
herein. 

The "3 1 non-coding sequences" refers to the DNA 
sequence portion of a gene that contains a 

25 polyadenylation signal and any other regulatory signal 
capable of affecting mRNA processing or gene expression. 
The polyadenylation signal is usually characterized by 
affecting the addition of polyadenylic acid tracts to 
the 3 1 end of the mRNA precursor . 

30 "Transformation" herein refers to the transfer of a 

foreign gene into the genome of a host organism and its 
genetically stable inheritance. "Restriction fragment 
length polymorphism" (RFLP) refers to different sized 
restriction fragment lengths due to altered nucletode 

35 sequences in or around variant forms of genes . 
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"Edible legumes" herein refers to the major edible 
leguminuous crops including beans (FhaseolUS sp.), chick 
pea ( Cicer aHfttinum ) , cowpea (Yigna unguiculata) , mung 
bean ( Vicrna radiata ) , peas (£isum sativum) r lentil (LsilS. 
5 culinaris ) and lupine (Lupines sp.) . 

"Coniferous horticultural species" herein refers to 
Fraser fir r Abies fraseri (Pursh) Poir.] and white pine 

(EiniLS. strpbus) • 

Purification of Zuc chini Leaf Galactinol Synthase 
10 Galactinol synthase was purified to a 35 kD doublet 

when analyzed by SDS polyacrylamide gel electrophoresis 
(SDS-PAGE) starting from the soluble fraction of 
extracts made from zucchini leaves- The extract was 
filtered,, chromatographically separated on a DE-52 
15 cellulose column , ammonium sulfate precipitated, 

chromatographically separated over Phenyl-Sepharose, 
passed over an Spectra/Gel® AcA 54 gel filtration column 

(Spectrum) and separated via a Mono-Q column 
(Pharmacia) . In a typical preparation, the galactinol 
20 synthase activity was purified about 800-fold. The 
individual 35 kD bands visualized by SDS-PAGE were 
separated via Mono-Q chromatography. Activity was 
detected from both peptides. The individual peptides 
were concentrated using an Amicon Centricon® and 

25 analyzed for amino acid composition. Results from this 
analysis indicated very similar if not identical 
composition. Approximately 700 \lg of the 35 kD doublet 
was cut from a gradient polyacrylamide gel (10-20% 
acrylamide) following electrophoresis. The protein was 

30 sent for antibody production in New Zealand White 

rabbits by Hazelton Research Products Inc., Denver, PA. 
Cloning of the Zucchin i Leaf Galactinol Synthase cDNA 
Both N-terminal and internal amino acid sequence 
information was obtained for the zucchini leaf 

35 holoenzyme. The internal sequence information consisted 
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of three peptides obtained from three fractions 
separated using a Vydac C4 re versed-phase column , 
following Mono-Q. These fractions were named peaks A, 
B, and C. Two of the peptides differed by a stretch of 
5 four additional residues which was present only in the 
peptide in peak B. Hence, only the amino acid sequence 
from peaks A and C were used for oligonucleotide design. 

The amino acid sequence information obtained for 
the individual peptides was used to design 

10 oligonucleotides for use as polymerase chain reaction 

(PCR) primers. However, the order of peptides relative 
to each other in the protein was not determined. Hence, 
primers for sense and antisense orientation were 
required for both peak A and peak C. Four degenerate 

15 oligonucleotides were designed with restriction enzyme 
sites at the ends to facilitate cloning of the 
product (s) into a suitable vector. 

cDNA for the PCR was generated via the action of 
reverse transcriptase on poly A+ RNA previously isolated 

20 from zucchini leaves. Using oligo (dT) 15 primer, a 
collection of cDNAs was synthesized representing the 
various lengths of mRNAs in the sample. This cDNA was 
then used as template DNA for PCR employing combinations 
of the degenerate oligonucleotides. The PCR using the 

25 peak A "sense" oligonucleotide and the peak C 

"antisense" oligonucleotide resulted in a 302 base pair 
(bp) product as determined by DNA sequencing following 
cloning into a bacterial expression vector (p300) . The 
fragment contained both peptide sequences from peaks A 

30 and C, plus, each preceeding Met where CNBr cleavage 
occurred, and the excluded nucleotides which coded for 
known amino acids. The lie in peak A, however, was 
actually a Met in the PCR product. This DNA insert was 
radiolabeled and used to screen a cDNA expression 

35 library made in lambda Uni-ZAP vector (Stratagene) from 
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poly A+ RNA isolated from young zucchini leaves. Two 
positively-hybridizing plaques were subjected to plaque 
purification. Sequences of the pBluescript (Stratagene) 
vector, including the cDNA insert s, from each of the 
5 purified phage stocks were excised in the presence of a 
helper phage and the resultant phagmids used to infect 
£. coli cells resulting in double-stranded plasmids, 
pl81 and pl82. The plasmid, pl82, contained an 842 bp 
EcoRl-Xhol insert and, when translated, coded for the 

10 stretch of amino acids found from p300, at the 5» end of 
the insert. The 681 bp EcoRl-Hindlll fragment of this 
clone was radiolabeled and used to rescreen the 
zucchini library. Thirty positively-hybridizing plaques 
were identified, ten of which were carried on for 

15 further purification and phagemid excision: p411, p711, 
P 812, P 821, pllll, pl!21, p!211, pl221, pl911. The cDNA 
insert in plasmid p812 is flanked by an EcoRl and a Xhol 
cloning site introduced by the cDNA construction, at the 
5* and 3* end, respectively. The 1265 bp nucleotide 

20 sequence encodes a 351 amino acid open reading frame 
that includes the N-terminal sequence found in the 
purified protein at the eleventh amino acid of the open 
reading frame. A fusion protein comprising the first 16 
amino acids of B-galactosidase and beginning at the 

25 first Met (the eleventh amino acid) of the zucchini leaf 
galactinol synthase in an appropriate plasmid is 
expressed in £. coli and is catalytically functional. 
nionina o^ Snvbgan £*gd Galactinol Synthase cDNA 
The 1265 bp insert from the zucchini galactinol 

30 synthase cDNA, p812, was radiolabeled and used to 

screen a cDNA expression library made in lambda Uni-ZAP 
vector (Stratagene) from poly A + RNA isolated from 
developing soybean seeds (Wye cv.). Ten positively- 
hybridizing plaques were purified and the phagemids 

35 excised. Restriction digest mapping indicated that one 
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of these f pS21, contained an insert of a size expected 
for the gene. The cDNA insert (SEQ ID NO: 6) was 1386 bp 
in length with a 987 bp coding sequence; the first 138 
bp of the clone was 5' non-coding sequence. The insert 
5 encodes an open reading frame of 375 amino acids with 
the start Met as residue 47. 

The Bluescript vector sequence (Stratagene) , 5' to 
the cDNA insert, was mutagenized to place the pS21 
coding sequence (SEQ ID NO: 6) in frame with the 
10 B-galactosidase gene r thus allowing for the expression 
of a functional galactinol synthase fusion protein in £. 

A comparison between the entire amino acid sequence 
from the coding regions of the zucchini and soybean 

15 cDNAs revealed 74% identity and 83% identity between the 
internal 288 amino acids. 

The fragment of the instant invention may be used, 
if desired, to isolate substantially homologous 
galactinol synthase cDNAs and genes, including those 

20 from plant species other than soybean. Isolation of 
homologous genes is well-known in the art. 

The nucleic acid fragment of the instant invention 
encoding galactinol synthase, or a coding sequence 
derived from other cDNAs or genes for the enzyme, with 

25 suitable regulatory sequences, can be used to 

overexpress the enzyme in transgenic soybean as well as 
other transgenic species . Such a recombinant DNA 
construct may include either the native galactinol 
synthase gene or a chimeric gene. One skilled in the 

30 art can isolate the coding sequences from the fragment 
of the invention by using and/or creating sites for 
restriction endonucleases, as described in Sambrook et 
al. [Molecular Cloning: A Laboratory Manual 2nd Ed. 
(1989) Cold Spring Harbor Laboratory Press] . Of 

35 particular utility are sites for Nco I (5 f -CCATGG-3 1 ) 
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and Sph I (5 '-GCATGC-3 f ) that allow precise removal of 
coding sequences starting with the initiating codon ATG. 
For isolating the coding sequence of galactinol synthase 
from p812, an Nco I site can be engineered by 
5 substituting nucleotides at positions 31 and 32 with C, 
and nucleotide T at position 36 with G. Cutting at this 
engineered site along with cuts at restriction 
endonuclease sites near the 3* end of p812 such as the 
Sph 1 at 1169 allows removal of the fragment encoding 

10 the galactinol synthase protein and directional re- 
insertion into a properly designed vector. 

Antisense RNA has been used to inhibit plant target 
genes in a dominant and tissue-specific manner [see 
van der Krol et al. (1988) Gene 72:45-50; Ecker et al. 

15 (1986) Proc. Natl. Acad. Sci. USA 83:5372-5376; 

van der Krol et al. (1988) Nature 336:866-869; Smith et 
al. (1988) Nature 334:724-726; Sheehy et al. (1988) 
Proc. Natl. Acad. Sci. USA 85:8805-8809; Rothstein 
et al. (1987) Proc. Natl. Acad. Sci. USA 84:8439-8443; 

20 Cornelissen et al. (1988) Nucl. Acids Res. 17:833-843; 
Cornelissen (1989) Nucl. Acid Res. 17:7203-7209; Robert 
et al. (1989) Plant Mol. Biol. 13:399-409]. 

The use of antisense inhibition of the seed enzyme 
would require isolation of the coding sequence for genes 

25 that are expressed in the target tissue of the target 

plant. Thus, it will be more useful to use the fragment 
of the invention to screen seed-specific cDNA libraries, 
rather than genomic libraries or cDNA libraries from 
other tissues, from the appropriate plant for such 

30 sequences. Moreover, since there may be more than one 

gene encoding seed galactinol synthase, it may be useful 
to isolate the coding sequences from the other genes 
from the appropriate crop. The genes that are most 
highly expressed are the best targets for antisense 

35 inhibition. The level of transcription of different 
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genes can be studied by known techniques, such as run- 
off transcription. 

For expressing antisense RNA in soybean seed from 
the fragment of the invention, the entire fragment of 
5 the invention may be used* There is evidence that the 
3 1 non-coding sequences can play an important role in 
antisense inhibition [Ch'ng et al. (1989) Proc. Natl. 
Acad. Sci. USA 86:10006-10010]. There have also been 
examples of using the entire cDNA sequence for antisense 

10 inhibition [Sheehy et al. (1988) Proc. Nat'l. Acad. Sci. 
USA 85:8805-8809] . 

A preferred class of heterologous hosts for the 
expression of the coding sequence of galactinol synthase 
or the antisense RNA are eukaryotic hosts, particularly 

15 the cells of higher plants including soybean ( Glycine 
max), zucchini, rapeseed (canola) , cotton, edible 

legumes, sugar beet, coniferous horticultural species, 
Stachys maize and tobacco. Particularly preferred among 
the higher plants are soybean ( Glycine max ) , rapeseed 
20 ( Brassica na^jlS.) and other edible legumes. Expression 
in plants will use regulatory sequences functional in 
such plants. 

The expression of foreign genes in plants is well- 
established [De Blaere et al. (1987) Meth. Enzymol. 

25 153:277-291]. The origin of promoter chosen to drive 
the expression of the coding sequence or the antisense 
RNA is not critical as long as it has sufficient 
transcriptional activity to accomplish the invention by 
increasing or decreasing, respectively, the level of 

30 translatable mRNA for galactinol synthase in the desired 
host tissue. Preferred promoters include strong 
constitutive plant promoters, such as those directing 
the 19S and 35S transcripts in cauliflower mosaic virus 
[Odell et al. (1985) Nature 313:810-812; Hull and Howell 

35 (1987) Virology 86:482-493], and tissue or 
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developmentally-specific promoters such as those for the 
small subunit of ribul'ose 1, 5-bisphosphate carboxylase 
[Morelli et al. (1985) Nature 315:200; Broglie et al. 
(1984) Science 224:838; Hererra-Estrella et al. (1984) 
5 Nature 310:115; Coruzzi et al. (1984) EMBO J. 3:1671; 

Faciotti et al. (1985) Bio/Technology 3:241], maize zein 
protein [Matzke et al. (1984) EMBO J. 3:1525], and 
chlorophyll a/b binding protein [Lampa e t al. (1986) 
Nature 316:750-752] . 

10 Specific timing of expression may be desirable. 

Chemically-inducible promoters [Hershey et al. (1990) 
WO 90/11361] allow the expression of a particular 
transgene at a specific stage of development. 

Depending upon the application, it may be desirable 

15 to select promoters that are not constitutive but 

specific for expression in one or more tissues of the 
plant. Such examples include the light-inducible 
promoters of the small subunit of ribulose 1,5- 
bisphosphate carboxylase, if the expression is desired 

20 in photosynthetic tissues, or seed-specific promoters. 

Particularly preferred promoters are those that 
allow seed-specific expression. This may be especially 
useful, since the seed is the portion of the plant in 
which the utility of the invention would be 

25 demonstrated, i.e., as low raffinose saccharide- 

containing animal feed, and also since seed-specific 
expression will avoid any potential deleterious effect 
in non-seed tissues. Examples of seed-specific 
promoters include but are not limited to the promoters 

30 of seed storage proteins, which can represent up to 90% 
of total seed protein in many plants. The seed storage 
proteins are strictly regulated, being expressed almost 
exclusively in seeds in a highly tissue-specific and 
stage-specific manner [Higgins et al. (1984) Ann. Rev. 

35 Plant Physiol. 35:191-221; Goldberg et al. (1989) Cell 
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56:149-160]. Moreover, different seed storage proteins 
may be expressed at different stages of seed 
development . 

Expression of seed-specific genes has been studied 
5 in great detail [see reviews by Goldberg et al. (1989) 
Cell 56:149-160 and Higgins et al. (1984) Ann. Rev. 
Plant Physiol. 35:191-221]. There are currently 
numerous examples for seed-specific expression of seed 
storage protein genes in transgenic dicotyledonous 
10 plants. These include genes from dicotyledonous plants 
for bean fi-phaseolin [Sengupta-Gopalan et al. (1985) 
Proc. Natl. Acad. Sci. USA 82:3320-3324; Hoffman et al. 

(1988) Plant Mol. Biol. 11:717-729] , bean lectin 
[Voelker et al. (1987) EMBO J. 6: 3571-3577] , soybean 

15 lectin [Okamuro et al. (1986) Proc. Natl. Acad. Sci. USA 
83: 8240-8244], soybean Kunitz trypsin inhibitor [Perez- 
Grau et al. (1989) Plant Cell 1:1095-1109], soybean B- 
conglycinin [Beachy et al. (1985) EMBO J. 4:3047-3053; 
Barker et al. (1988) Proc. Natl. Acad. Sci. USA 85:458- 

20 462; Chen et al. (1988) EMBO J. 7:297-302; Chen et al. 

(1989) Dev. Genet. 10:112-122; Naito et al. (1988) Plant 
Mol. Biol. 11:109-123], pea vicilin [Higgins et al. 
(1988) Plant Mol. Biol. 11:683-695], pea convicilin 
[Newbigin et al. (1990) Planta 180:461], pea legumin 

25 [Shirsat et al (1989) Mol. Gen. Genetics 215:326]; 

rapeseed napin [Radke et al. (1988) Theor. Appl. Genet. 
75:685-694] as well as genes from monocotyledonous 
plants such as for maize 15-kD zein [Hoffman et al. 
(1987) EMBO J. 6:3213-3221], and barley B-hordein 

30 [Marris et al. (1988) Plant Mol. Biol. 10:359-366] and 
wheat glutenin [Colot et al. (1987) EMBO J. 6:3559- 
3564]. Moreover, promoters of seed-specific genes 
operably linked to heterologous coding sequences in 
chimeric gene constructs also maintain their temporal 

35 and spatial expression pattern in transgenic plants. 
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Such examples include fiopsis thaliana 2S seed 

storage protein gene promoter to express enkephalin 
peptides in .AirabidPPSiS and fi. napUS seeds 
[Vandekerckhove et al. (1989) Bio/Technology 7:929-932], 
5 bean lectin and bean B-phaseolin promoters to express 
lucif erase [Riggs et ai. (1989) Plant Sci. 63: 47-57] , 
and wheat glutenin promoters to express chloramphenicol 
acetyl transferase [Colot et al. (1987) EMBO J. 6:3559- 
3564] . 

10 Of particular use in the expression of the nucleic 

acid fragment of the invention will be the heterologous 
promoters from several extensively-characterized soybean 
seed storage protein genes such as those for the Kunitz 
trypsin inhibitor [Jofuku et al. (1989) Plant Cell 

15 1:1079-1093; Perez-Grau et al. (1989) Plant Cell 1:1095- 
.1109], glycinin [Nielson et al. (1989) Plant Cell 1:313- 
328], B-conglycinin [Harada et al. (1989) Plant Cell 
1:415-425]. Promoters of genes for a- and B-subunits of 

soybean B-conglycinin storage protein will be 

20 particularly useful in expressing the mRNA or the 

antisense RNA to galactinol synthase in the cotyledons 
at mid- to late-stages of seed development [Beachy et 
al. (1985) EMBO J. 4:3047-3053; Barker et al. (1988) 
Proc. Natl. Acad. Sci. USA 85:458-462; Chen et al. 

25 (1988) EMBO J. 7:297-302; Chen et al. (1989) Dev. Genet. 
10:112-122; Naito et al. (1988) Plant Mol. Biol. 11:109- 
123] in transgenic plants, since: a) there is very 
little position effect on their expression in transgenic 
seeds, and b) the two promoters show different temporal 

30 regulation: the promoter for the a-subunit gene is 

expressed a few days before that for the B-subunit gene. 

Proper level of expression of galactinol synthase 
mRNA or antisense RNA may require the use of different 
chimeric genes utilizing different promoters. Such 

35 chimeric genes can be transfered into host plants either 
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together in a single expression vector or sequentially- 
using more than one vector. 

It is envisioned that the introduction of enhancers 
or enhancer-like elements into either the native 
5 galactinol synthase promoter or into other promoter 

constructs will also provide increased levels of primary 
transcription for antisense RNA or in RNA for galactinol 
synthase to accomplish the inventions . This would 
include viral enhancers such as that found in the 35S 

10 promoter [Odell et al. (1988) Plant Mol. Biol, 10:263- 
272], enhancers from the opine genes [Fromm et al. 
(1989) Plant Cell 1:977-984], or enhancers from any 
other source that result in increased transcription when 
placed into a promoter operably linked to the nucleic 

15 acid fragment of the invention. 

Of particular importance is the DNA sequence 
element isolated from the gene for the a— subunit of 

conglycinin that can confer 40-fold seed-specific 
enhancement to a constitutive promoter [Chen et al. 

20 (1988) EMBO J. 7:297-302; Chen et al. (1989) Dev. Genet. 
10:112-122]. One skilled in the art can readily isolate 
this element and insert it within the promoter region of 
any gene in order to obtain seed-specific enhanced 
expression with the promoter in transgenic plants . 

25 Insertion of such an element in any seed-specific gene 
that is expressed at different times than the JJ- 
conglycinin gene will result in expression in transgenic 
plants for a longer period during seed development. 
Any 3 f non-coding region capable of providing a 

30 polyadenylation signal and other regulatory sequences 
that may be required for the proper expression of the 
galactinol synthase coding region can be used to 
accomplish the invention. This would include the 3' end 
from viral genes such as the 3* end of the 35S or the 

35 19S cauliflower mosaic virus transcripts, the 3' end 
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from the opine synthesis genes , the 3 1 ends of ribulose 
1,5-bisphosphate carboxylase or chlorophyll a/b binding 
protein, or 3' end sequences from any source such that 
the sequence employed provides the necessary regulatory 
5 information within its nucleic acid sequence to result 
in the proper expression of the promoter/galactinol 
synthase coding region combination to which it is 
operably linked. There are numerous examples in the art 
that teach the usefulness of different 3 1 non-coding 

10 regions . 

Various methods of transforming cells of higher 
plants according to the present invention are available 
to those skilled in the art (see EPO publications 
0 295 959 A2 and 0 318 341 Al) . Such methods include 

15 those based on transformation vectors based on the Ti 
and Ri plasmids of Acrrobacterium spp . It is 
particularly preferred to use the binary type of these 
vectors. Ti-derived vectors transform a wide variety of 
higher plants, including monocotyledonous and 

20 dicotyledonous plants, such as soybean, cotton and rape 
[Pacciotti et al. (1985) Bio/Technology 3:241; Byrne 
et al. (1987) Plant Cell, Tissue and Organ Culture 8:3; 
Sukhapinda et al. (1987) Plant Mol. Biol. 8:209-216; 
Lorz et al. (1985) Mol. Gen. Genet. 199:178; Potrykus 

25 (1985) Mol. Gen. Genet. 199:183]. Other transformation 
methods are available to those skilled in the art, such 
as direct uptake of foreign DNA constructs [see EPO 
publication 0 295 959 A2], techniques of electroporation 
[see Fromm et al. (1986) Nature (London) 319:791] or 

30 high-velocity ballistic bombardment with metal particles 
coated with the nucleic acid constructs [see Kline 
et al. (1987) Nature (London) 327:70, and US 4]. Once 
transformed, the cells can be regenerated by those 
skilled in the art. 
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Of particular relevance are the recently described 
methods to transform foreign genes into commercially 
important crops such as rapeseed [see De Block et al. 

(1989) Plant Physiol. 91:694-701], sunflower [Everett et 
5 al. (1987) Bio/Technology 5: 1201] , and soybean [McCabe 

et al. (1988) Bio/Technology 6:923; Hinchee et al. 
(1988) Bio/Technology 6:915; Chee et al. (1989) Plant 
Physiol. 91:1212-1218; Christou et al. (1989) Proc. 
Natl. Acad. Sci USA 86:7500-7504; EPO Publication 

10 0 301 749 A2] . 

The use of restriction fragment length polymorphism 
(RFLP) markers in plant breeding has been well- 
documented in the art [see Tanksley et al. (1989) 
Bio/Technology 7:257-264]. By using the nucleic acid 

15 fragment of the invention as a probe for Southern 

blotting, it could be shown that at least two galactinol 
synthase loci exist in the soybean genome. One of these 
has has been mapped on a soybean RFLP map [Tingey et al. 

(1990) J. Cell Biochem., Supplement 14E p. 291, abstract 
20 R153] . It can thus be used as a RFLP marker for traits 

linked to this mapped locus . This is valuable for 
future mapping and tracking of genes which result in 
agronomically important phenotypes . 

EXAMPLES 

25 The present invention is further defined in the 

following EXAMPLES, in which all parts and percentages 
are by weight and degrees are Celsius, unless otherwise 
stated. It should be understood that these EXAMPLES, 
while indicating preferred embodiments of the invention, 

30 are given by way of illustration only. From the above 
discussion and these EXAMPLES, one skilled in the art 
can ascertain the essential characteristics of this 
invention, and without departing from the spirit and 
scope thereof, can make various changes and 

35 modifications of the invention to adapt it to various 
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usages and conditions. All such modifications are 
intended to fall within the scope of the appended 
claims . 

F.XAMPLE 1 

5 CLQSIHS OF ZUCCHINI T .F.AF GALACTINOL SYNTHASE CDNA 

r^i^tinol Synthase Assay 
Galactinol synthase activity was determined using 
an isotopic assay previously described by Pharr et al. 
[(1981) Plant Sci. Lett, 23:25-33; Handley et al. (1982) 

10 Z. Pflanzenphysiol. Bd 108 S. 447-455] with 

modifications of the reaction mixture and temperature. 
The complete reaction mixture contained enzyme extract 
(15 flL), UDP-{U- 14 C} galactose (0.1 tlmol, 1.0 M£i/jimol) 
(Amersham), myo-inositol (2.0 Jlmol) , MnCl2 (0.1 Jimol) , 

15 dithiothreitol (DTT) , 1.0 Jlmol) and 3- (N-morpholino) - 
. propanesulf onic acid (MOPS)/NaOH buffer at pH 7.0 (5.0 
flmol) , in a total volume of 100 MX. The incubation 
temperature was 25°C, and the reactions were terminated 
after 10 min. ScintiVerse Bio HP (Fisher) scintillation 

20 fluid (10 mL) was added to 300 JlL of the assay mix plus 
enzyme extract and radioactivity in the unbound carbon 
was determined by scintillation counting. 

An additional spectrophotometry assay was 
developed for use in assaying enzyme fractions purified 

25 through the Phenyl-Sepharose (Pharmacia) column. The 
assay mixture consisted of 1 mM UDP-galactose, 20 mM 
myo-inositol, 1 mM MnCl 2 , 25 mM MOPS/NaOH (pH 7.0) 20 
units/mL nucleoside diphosphokinase, 40 units/mL 
pyruvate kinase, 62.5 units/mL lactate dehydrogenase, 

30 0.15 mM NADH, 1 mM ATP and 0.4 mM PEP. All components 
were purchased from Sigma. Comparative assays for each 
sample were routinely done using the above assay mixture 
excluding the myo-inositol. A 1:10 mixture of enzyme 
extract to assay mixture was made and monitored at 340 

35 nm for 5 min at 25°C. 
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Purification of Zucchini Leaf Galan tine! Synth*** 
Leaves from 3-4 week old hybrid zucchini plants 
(500 g, Burpee-"The Gardener's Favorite") were harvested 
and stored at -80°C until use. Frozen leaves were added 
5 to 2000 mL of a buffer consisting of 50 mM MOPS/NaOH (pH 
7.0), 10 mM DTT, 2 mM MnC1.2, and 1 mM phenylmethyl- 
sulfonyl fluoride (PMSF) and ground with a Brinkmann 
Polytron until thawed and homogenized. The homogenate 
was filtered through cheesecloth and precipitated at 4°C 

10 by stirring in 1/3 the volume of 50% PEG 8000 for 20 min 
and centrifuged at 13,000xg for 20 min. The supernatant 
was applied to a 500 mL DE-52 radial flow column 
(Sepragen) . Fractions were eluted with 6 L of a NaCl 
gradient stepped from 0-0.5 M, at a flow rate of 250 

15 mL/min. Fractions with high enzymatic activity were 
.adjusted to 20% saturation with ammonium sulfate, 
applied to a 100 mL Phenyl-Sepharose radial flow column 
and eluted with 300 mL of an ammonium sulfate gradient 
stepping down from 20-0%, followed by 650 mL of the MOPS 

20 buffer described above. Fractions with high activity 

were concentrated as previously described and applied to 
a 2.5 x 90 cm Spectra/Gel® AcA 54 gel filtration column 
(Spectrum) and eluted with 50 mM MOPS/NaOH (pH 7.0), 0.2 
M NaCl, 2 mM DTT at a flow rate of 1 mL/min. Using a 

25 Centricon Centripreps® (10,000) the high-activity 

fractions were concentrated and dialyzed overnight 
against 50 mM MOPS (pH 7.0) and 2 mM DTT at 4°C. The 
sample was then applied to a 5/5 Mono-Q column 
(Pharmacia) and eluted with a 60 mL NaCl gradient 
30 stepping from 0-0.375 M in 50 mM MOPS/NaOH (pH 7.0) and 
2 mM DTT. The high-activity fractions were concentrated 
to 50 \iL using a Amicon Centricon. The samples were 
stored at -80°C in 25% glycerol. 
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y-TR.rminal and Intern al Amino Acid Sequence 

■FT-om the zucch ini L eaf Gal act ing ! Synthase 

The peak fractions of a zucchini leaf prep from 
the Mono-Q column were pooled and concentrated to 1 mL. 
5 The entire sample was loaded onto a Vydac C4 (reversed 
phase) column. The column was eluted with a gradient 
from 100% A buffer (0.1% trifluoroacetic acid (TFA) in 
water) to 40% B buffer (0.1% TFA in acetonitrile) in 5 
min. From 5 to 30 rain the buffer was increased to 50%. 

10 Fractions #21-25 were collected and concentrated to 200 
|IL (from 1 mL) using a nitrogen stream. Two hundred JIL 
of 9 M guanidine-HCl, 1 mM EDTA, 0.375 M Tris (pH 8.5) 
and 4 (IL of 1.4 M B-mercaptoethanol in water were added 
to the concentrate. The solution was incubated at 37°C 

15 for 30 min. Four [XL of 4-vinylpyridine were added and 
the solution was incubated at room temperature for 3 h. 
The solution was then loaded onto a Vydac C4 column and 
eluted with the above gradient. The alkylated 
galactinol synthase peak, which eluted between 16-20 

20 min, was collected and concentrated to 200 \IL as 

described above. Seven hundred \LL of 88% formic acid 
and 200 (XL of 20 mg/mL cyanogen bromide (CNBr) in 70% 
formic acid was added to the sample. The mixture was 
incubated at room temperature overnight, under nitrogen. 

25 The sample was then applied to the Vydac C4 column and 

eluted with a gradient from 100% buffer A for 5 min, and 
then, by increasing buffer B from 0% to 90% for 85 min. 
One mL fractions were collected. Five major peak 
fractions were pooled as follows: Peak A = #31-32, Peak 

30 B = #41, Peak C = #44-45, Peak D = #51, Peak E = #52. 
Each peak was concentrated to 100 \LL, as described 
above. Protein sequence analysis was performed using an 
Applied Biosystems 470A Gas Phase Sequencer. 
Phenylthiohydantoin (PTH) amino acids were analyzed on 

35 an Applied Biosystems 120 PTH Amino Acid Analyzer. 
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Peaks A, B, and C contained sufficient material to 
obtain sequence information. The peptide from peak A 
gave the following sequence from its N-terminal: 
Pro-Asn-Gly-Tyr-Phe-Tyr-Ala-Val-Ile-Asp (SEQ ID NO: 7) . 
5 Peak B gave the following sequence: Tyr-Phe-Asn-Asp- 
Ile-Tyr-Lys-Pro-Ile-Pro-Xaa-Ile-Tyr-Asn (SEQ ID NO: 8) . 
Peak C gave the following sequence: Tyr-Phe-Asn-Asp- 
Ile-Tyr-Lys-Pro-Ile-Pro (SEQ ID NO: 9). The identity of 
the Xaa residue in peak B was not determined, 

10 To obtain the N-terminal sequence of the 

holoenzyme, Mono-Q-purif ied enzyme was loaded onto a 10- 
20% acrylamide gradient gel and subjected to 
electrophoresis. The developed gel was 
electrophoretically blotted onto Immobilon membrane 

15 (Millipore), stained with coomassie blue and destained. 
.The 35 kD band was cut from the Immobilon and the 
N-terminus was determined by gas phase sequencing as 
described above. The entire sequence obtained for the 
fragment is as follows: Pro-Ala-Ala-Thr-Glu-Thr-Ala- 

20 Ile-Giu-Xaa-Thr-Asp-Ala-Pro-Lys-Arg-Ala-Phe-Val (SEQ ID 
NO:10) . 

Clonincr of the Zucchini Leaf Galactinol Synthase sBMfl 
Four degenerate oligonucleotides were designed from 
the amino acid sequences obtained from peak A and peak C 
25 for use as polymerase chain reaction (PCR) primers. The 
degenerate oligonucleotides were designed with flanking 
restriction enzyme recognition sequences and sense and 
antisense orientations were synthesized for each peptide 
sequence : 

30 

Peak A sense 
Peak A antisense 
Peak C sense 
Peak C antisense 

35 
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The oligonucleotides were synthesized on an Applied 
Biosystems 394 DNA/RNA Synthesizer: 

5'-GCGGCCGC ATG CCA AAY GGR TAY TTY TAY GC-3' (SEQ ID NO:l) 

5 Pro Asn Gly Tyr Phe Tyr Ala Val lie Asp (SEQ ID NO: 7) 

3' -ATR AAR ATR CGR CAM TAR CT-5' (SEQ ID NO: 2) 

5'-GCGGCCGC ATG TAY TTY AAY GAY ATH TAY AA-3 * (SEQ ID NO: 3) 

Tyr Phe Asn Asp He Tyr Ly3 Pro He Pro (SEQ ID NO: 9) 
10 3 f -TTR CTR TAR ATR TTY GGW TAR GG-5' (SEQ ID NO: 4) 

A sense M = A,C 

A f antisense W « A,T 

C sense Y = C,T 

15 C antisense R = A, G 

H = A,C,T 

Since it was not known which peptide (A or C) was closer 
to the N terminus of galactinol synthase, two reactions 

20 were run, using either A and C or C and A f . The cDNA 
for the PGR was generated using reverse transcriptase 
(Stratagene) on 5 Jig of poly A+ RNA previously isolated 
from zucchini leaves. Using oligo (dT) 15 primer, a 
collection of cDNAs was synthesized according to the 

25 first strand synthesis procedure described by Promega 
Biotec. PCR was performed using a Perkin-Elmer Cetus 
kit according to the instructions of the vendor on a 
thermocycler manufactured by the same company. When run 
on a 1% agarose gel and stained with ethidium bromide, 

30 the product of the reaction employing oligomers A and C f 
showed a strong DNA band of approximately 300 bp. The 
band was cut from the gel, purified with Gene Clean (Bio 
101 Corp.) according to the vendor's instructions and 
cloned into the pBluescript SK + vector (Stratagene) . 

35 DNA sequencing was done using the appropriate 
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complementary primers and a sequenase kit from United 
States Biochemicals Company according to the vendor's 
instructions. The 302 bp PCR product, when translated, 
coded for peptide sequences from peaks A and C in their 
5 entirety, including the Met residues which preceeded 
each peptide, A radioactive probe was made by random 
priming using 32p- c jCTP and a random priming kit 
purchased from Boehringer Mannheim in order to screen a 
zucchini Uni-ZAP cDNA library. The library was made as 

10 follows: Leaves from the first three nodes of 3-4 week 
old hybrid zucchini plants (Burpee-"The Gardener's 
Favorite") were harvested and stored at -80°C. Frozen 
leaf tissue (20 g) was ground to a fine powder with a 
mortar and pestle. To the tissue, 5 mL of chloroform, 5 

15 mL of vanadyl ribonucleoside complex (Bethesda Research 
Laboratories) and 100 mL of guanidine thiocyanate (GTC) 
reagent [3 M GTC, 10 mM Tris/HCl (pH 8.0), 20 mM EDTA] 
was added. The mixture was centrifuged at 15,000 RPM in 
a Sorval SS34 rotor for 10 min at room temperature. The 

"0 supernatant was then layered over 5 mL pads of 5.7 M 

CsCl (made up in 100 mM EDTA, pH 7.6) in 9/16" x 3-1/2" 
polyallomer ultracentrifuge tubes. The samples were 
centrifuged at 28,000 RPM in a Beckman SW28 rotor for 18 
h at 20°C. The supernatant was removed by aspiration. 

25 The RNA pellets at the bottom of the tubes were 

dissolved in a total of 2 mL of TES buffer [10 mM 
Tris/HCl (pH 7.4), 5 mM EDTA, 1% SDS] . The RNA was 
extracted with an equal volume of chloroform :n-butanol 
(4:1) and precipitated overnight with ethanol, and, 1/10 

30 volume of 3 M sodium acetate. The RNA was washed with 
70% ethanol, dried down and redissolved in 0.5 mL of 
water. Poly A + RNA was isolated using poly (U) Sephadex 
(Bethesda Research Laboratories) according to 
instructions supplied by the vendor. Ten |ig of poly A + 

35 RNA was sent to Stratagene for construction of a lambda 
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Uni-ZAP cDNA library. Titering of the library upon its 
arrival indicated that it contained 1.7 x 10 7 plaque- 
forming units (pfu) /\IL of amplified library stock. 

About 2000 pfu were plated per 80 mm plate on a 
5 total of 20 NZY-agar plates [Sambrook et al. (1989) 
Molecular Cloning: A Laboratory Manual, 2nd Ed. Cold 
Spring Harbor Laboratory Press] giving 40,000 total 
plaques. Plating was done using fi. gpli XLl-Blue 
(Stratagene) grown in NZY +0.2% maltose as the host and 

10 NZY-7.2% agarose as the plating medium. The plaques 
were absorbed onto nitrocellulose filters (Schleicher 
and Schuell, 0.45 mM pore size), in duplicate, denatured 
in 1.5 M NaCl, 0.5 M Tris/HCl (pH 7.5), and rinsed in 2x 
SSC. The filters were blotted on Whatman 3MM paper and 

15 heated in a vacuum oven at 80°C for 2 h to allow firm 
anchorage of phage DNA in the membranes. The 
nitrocellulose filters were screened using the* 
radioactive probe made from the 302 bp PCR fragment. 
After prehybridizing at 50°C in 1% bovine serum albumin 

20 (BSA), 0.5 M NaPi pH 7.2 (NaH2P04 and Na2HP04) , 10 mM 
EDTA, and 7% sodium dodecyl sulfate (SDS) for 4 h, the 
filters were transferred to fresh hybridization mix 
containing the denatured radiolabeled probe and stored 
overnight at 50°C. The filters were rinsed the next day 

25 under stringent conditions: 2x SSC [Sambrook et al. 

(1989) Molecular Cloning: A Laboratory Manual, 2nd Ed. 
Cold Spring Harbor Laboratory Press] at room temp for 15 
min, 2x SSC - 1% SDS at 50°C for 15 min, and lx SSC - 1% 
SDS at 50°C for 15 min. The filters were air-dried and 

30 placed against Kodak XAR-5 film at -70°C. From these 
autoradiograms, twenty hybridizing plaques were 
identified. These plaques were selected from the 
original petri plate and plated out at a dilution to 
yield about 100 plaques per 80 mm plate. These plaques 

35 were absorbed to nitrocellulose filters and reprobed 



WO 93/02196 



37 



PCT/US92/06057 



using the same procedure. After autoradiography, only 
two of the original ten showed plaques . Pure clones 
were isolated and phagemid DNA was excised in the 
presence of helper phage f (pl81, p!82) . DNA from the 
5 plasmids was isolated by the alkaline lysis miniprep 
procedure described in Sambrook et. al. (1989) 
[Molecular Cloning: A Laboratory Manual, 2nd Ed. Cold 
Spring Harbor Laboratory Press] . Restriction 
endonuclease digests of the DNA indicated that the two 

10 clones were very similar. Following sequencing with the 
United States Biochemicals sequence kit, as previously 
described, it was determined that the plasmid p!82 
contained an 842 bp EcoRl-Xhol insert and, when 
translated, coded for the stretch of amino acids found 

15 in the translation of the 302 bp probe, at the 5' end of 
the insert. A 681 bp EcoRl-Hindlll fragment of this 
clone was radiolabelled as previously described and used 
to rescreen 40,000 pfu from the zucchini library. 
Thirty positively-hybridizing plaques were identified, 

20 ten of which were selected for further purification and 
phagemid excision. One of these ten, p812, contained an 
1265 bp nucleotide sequence which is shown in SEQ ID 
NO:5. 

EXAMPLE 2 

25 CLONI NG OF SOYBEAN SE EP SflLACTINOL SYNTHASE cPNA 

HSIKS THE ZUCCHINI GALACTINOL SYNTHASE cDNA CLONE 
The 1265 bp insert of zucchini cDNA, p812 (SEQ ID 
NO: 5), was radiolabelled as previously described, for 
use as a probe to screen the soybean seed cDNA library. 

30 In order to construct a library, poly A + RNA was 

isolated from 27 day old developing soybean seeds (Wye 
cv.) . Ten fig of this RNA was sent to Stratagene for 
construction of a Uni-ZAP cDNA expression library. Upon 
arrival, the library was titered and determined to 

35 contain 1.2 x 10 9 pfu units/p.L of amplified stock. Two 
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thousand pfu were plated as described above and absorbed 
onto nitrocellulose filters. The filters were screened 
using the radioactive probe. Following autoradiography 
of the filters, two positively-hybridizing plaques were 
5 identified. Both were further purified and excision was 
performed in the presense of a helper phage as described 
above. Double-stranded DNA was prepared using the 
alkaline lysis method as previously described. 
Restriction analysis revealed that the two clones, pSll 

10 and pS21, were similar. Sequence analysis of the pS21 

insert, using the US Biochemicals Corp. sequenase kit as 
described above, indicates that although the insert 
unexpectedly lacked the two cloning sites, Eco Rl and 
Xhol, associated with the Uni-ZAP vector, it did contain 

15 a 1386 bp nucleic acid sequence (SEQ ID NO: 6) encoding 
the soybean seed galactinol synthase. 

RX AMPLE 2A 

HONSTRPCTTON OF PL ANT GENE EXPRESSION VECTOR 
CONTAIN!^ SOYBEAN GALACTINOL SYNTHASE GENE 
20 The 1.4kb Kpn I fragment from pS21 was ligated into 

a new pBSK (Stratagene Cloning Systems) vector that had 
been cut with Kpn I and dephosphorylated. Plasmid 
pS21-l was isolated from transformed XL1 blue cells and 
shown to contain the galactinol synthase coding region 
25 in the same orientation as that of the vector-borne 

(3-galactosidase, though it was in a different reading 

frame. pS21-l was digested with BamH I, a unique site 
in the vector, and the resulting 5' overhangs were 
filled in using Klenow fragment in the presence of 1 mM 
30 DNTPs. After ligation and transformation of XL1 blue 
cells, recombinant colonies containing pS21-l-B2 were 
identified that contained the galactinol synthase coding 
region in the correct orientation and the same reading 
frame as |}-galactosidase . 
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Plasmid p35S:GAS was made by removing the 
0-glucuronidase coding region from plasmid pMH40 

(Fig. 1) and replacing it with the galactinol synthase 
coding region from plasmid pS21-l-B2. To this end, 
5 plasmid pS21-l-B2 was digested to completion with the 
restriction endonucleases Pst I and Bfr I. The 5* and 
3' overhanging ends of the resulting DNA fragments were 
rendered blunt by treatment with the Klenow fragment of 
DNA polymerase I in the presence of 1 mM dNTPs. The 
10 blunt-ended DNA fragments were then separated using 

polyacrylamide gel electrophoresis and the 1250 bp DNA 
fragment encoding galactinol synthase was recovered from 
the gel. 

The vector pMH40 was digested to completion with 

15 the restriction endonucleases Xho I and Asp 7181 and the 
resulting 5* overhanging ends were filled in using 
Klenow fragment in the presence of 1 mM dNTPs • " The 
resulting blunt-ended vector was treated with calf 
intestinal alkaline phosphatase and ligated overnight 

20 with an equimolar amount of the 1250 bp galactinol 
synthase DNA fragment isolated above* The 
transformation mixture was diluted ten-fold with TE pH 
7.5 and an aliquot of this dilution was used to 
transform commercially available competent HB101 cells. 

25 Small scale plasmid preparations performed on 

ampicillin-resistant colonies were analyzed by agarose 
gel electrophoresis after digestion with Sal I. 
Plasmids from colonies containing a 3,26 kbp Sal I 
fragment were then digested with Bfr I and Xho I to 

30 determine the orientation of the galactinol synthase 

insert in the plasmids. Plasmids showing a 1.7 kbp band 
upon digestion Bfr I and Xho I contained the galactinol 
synthase insert in a sense orientation with respect to 
the 35S promoter and one was designated p35S:GAS 

35 (Fig. 2) . 
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The entire 35S-galactinol synthase gene was excised 
from p35S:GAS and cloned into the binary vector pZS194b 
(Fig. 3) - The binary vector pZS194b is but one example 
of a large number of binary vectors available that could 
5 be used for this purpose. It contains a left border 
fragment from the octopine Ti plasmid pTiA6 and a right 
border fragment from pTiAch5 (van den Elzen et al., 
Plant Molecular Biology 5: 149-154 f 1985) . The border 
fragments delimit the segment of DNA which becomes 

10 incorporated into the host plant genome during the 

process of AgrroJbacteriujn-mediated transformation. A 
chimeric marker gene (35S/NPT II/OCS) which specifies 
kanamycin resistance in plant cells and the 
p-galactosidase (Lac Z) gene intercepted by the 

15 polylinker sequence of pUC18 is positioned between the 
left and right border fragments. 

The binary vector pZS194b was digested to 
completion with the restriction endonuclease Sal I and 
treated with alkaline phosphatase. Plasmid p35S:GAS was 

20 also digested to completion with Sal I and the digestion 
products were separated by agarose gel electrophoresis. 
The 3.26 kbp 35S:GAS DNA fragment was excised from the 
gel, purified using glass milk and ligated overnight 
with an equimolar amount of Sal I-digested and 

25 phosphate-treated pZS194b. An aliquot of the ligation 
mixture was then used to transform commercially 
available competent HB101 cells using the manufacturer's 
protocol. Aliquots of the transformation mixture were 
plated on LB agar containing 100 ug/mL kanamycin and 

30 grown overnight at 37°C. Small scale plasmid 

preparations made from individual kanamycin-resistant 
colonies were digested with Sal I and analyzed by 
agarose gel electrophoresis. One colony containing a 
plasmid consisting of the 3.26 kbp 35S:GAS fragment from 



WO 93/02196 



41 



PCT/US92/06057 



p35S:GAS ligated into the Sal I site of pZS194b was 
designated pZS194b/35S :GAS ; 3 «NOS . 

The binary plasmid pZS194b/35S :GAS : 3 f NOS was 
mobilized from E. coli HB101 to Agrobacterium strain LBA 
5 4404 (Hoekema, et al., Nature 303: 179-180, 1983) 
through a triparental mating procedure (Ruvkin and 
Ausubel, Nature 289:85-88, 1981) . 

EXAMPLE 3 

EXPRESSION OF ZUCCHINI LEAF 

10 GALACTINOL SYNTHASE IN E , CffiU 

Sequences which are inserted into pBluescript and 
which are direct ionally correct and in- frame with the 
starting methionine of the interrupted fi-galactosidase 
gene borne on the plasmid are capable of being expressed 

15 as fusion proteins consisting of N-terminal amino acids 
of B-galactosidase plus those encoded by the inserted 
sequence. Sequencing of p812 (SEQ ID NO: 5) revealed 
that the cDNA insert was within the EcoRl and Xhol sites 
of that plasmid and was directionally correct but out of 

20 frame with respect to the B-galactosidase sequences . To 
improve the production of galactinol synthase, p812 (SEQ 
ID NO: 5) was modified to place the galactinol synthase 
sequences in-frame with those of B-galactosidase. 

One \Lg of p812 was digested for 1.5 h with 10 1.0. 

25 of Smal and then for an additional 1.5 h with 10 I.D. of 
EcoRI at 37°C. These digests cut the DNA at two places 
within the multiple cloning site of the vector 
Bluescript SK + (Stratagene) and provided a linear 4.1 kb 
fragment which was isolated after electrophoresis on a 

30 0.8% agarose gel. The fragment was incubated with 2 
1.0. of the Klenow fragment of DNA polymerase in the 
presence of 25 JXM dATP and 25 |1M dTTP for 30 min at room 
temperature to fill in the EcoRI half-site. Following 
purification with Gene Clean (Bio 101 Corp.) according 

35 to the manufacturer's instructions, the fragment, now 
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with two blunt ends, was re-circularized by incubating 
with 10 Weiss units of T4 DNA ligase overnight. 
Competent £. £Qlx XL-1 Blue cells (Statagene) were 
transformed according to instructions provided by the 
5 vendor with -50 ng of the ligated plasmid. 

Transformants were selected as ampicillin-resistant 
colonies after overnight growth on LB-agar plates. Five 
colonies were chosen and mini-preparations of plasmid 
DNA were made by the alkaline lysis procedure described 

10 above. Following restriction with either Eco Rl or Sma 
1 and Kpn 1, the plasmids were analyzed by agarose gel 
electrophoresis. This analysis indicated that three of 
the plasmids lacked the Eco Rl and Sma 1 sites of p812 
as anticipated. One of these plasmids r designated 

15 p8125, was chosen for further analysis. Sequencing of 
the 5* junction between the vector and insert sequences 
of p8125 confirmed that the EcoRI and Smal sites had 
been destroyed, and that the original sequence in this 
region of p812, 

20 

5 t- g3£SMECCC ^GCTGCflr? GflATTCG GCA CGAGTGTTGT - 3«, (SEQ ID NO: 11) 

had been converted to 

25 5'- GXGS&lCCeC ^AATTCGGCA CGAGTGTTGT - 3' (SEQ ID NO: 12) 

in p8125 (underlined sequences are from the pBluescript 
vector) . These manipulations created an open reading 
frame encoding a fusion protein consisting of an 
30 additional 46 amino acid N-terminal extension (from the 
B-galactosidase and 5 T non-coding zucchini sequences) 
fused to the 351 amino acid polypeptide of the zucchini 
GaS protein. 

Liquid cultures of transformed £. QQlL XLl-Blue 
35 cell lines containing plasmids p812 or p8125, as well as 
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r 



10 



15 



20 



untransformed XLl-Blue cells , were made by inoculating 5 
itiL of LB media Sambrook et. al. (1989) [Molecular 
Cloning: A Laboratory Manual, 2nd Ed, Cold Spring 
Harbor Laboratory Press] with cells from plated 
colonies. The media for the transformed cells also 
contained 100 flg/mL of ampicillin whereas the media for 
the untransformed cells contained 12 . 5 p,g/mL of 
tetracycline. After overnight growth at 37°C, 0.5 mL of 
the overnight cultures were each diluted into 50 mL of 
LB media containing the appropriate antibiotic, and 
incubation at 37°C was continued. After reaching an 
optical density at 600 nm of 0.6, the cultures were 
split in half. Isopropyl thiogalactoside (IPTG) was 
added to one 25 mL aliquot to give a final concentration 
of 1 mM; the other aliquot was not treated with IPTG. 
Growth of the cultures was continued for 18 h at 37°C, 
at which time the cells were harvested by 
cent rifugat ion, weighed, and frozen at -80°C for 1 h. 
The cells were resuspended in buffer containing 50 mM 
MOPS (pH 7.0) and 2 mM DTT using 5 mL of buffer/g cells. 
The cells were disrupted by probe sonication at 4°C. 
Cell debris was removed by centrifugation, and 5-15 \iL 
portions of the supernatants were assayed for galactinol 
synthase activity in the presence and absence of 
myoinositol using the 14 C-assay described above. 
Myoinositol-dependent galactinol synthase activities of 
the various extracts are shown below: 



Extract 



IPTG 



Net reaction 
llmol/min-q cells 



XLl-Blue 



0 
0 



+ 



p812 /XLl-Blue 



0.04 
0.25 



+ 



p8125/XLl-Blue 



20 
22 
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These results confirm that, although p812- 
containing E- cells produce measurable galactinol 

synthase activity, the in-frame placement of galactinol 
5 synthase in the p8125 clones considerably increased the 
production of active r recombinant enzyme. Western blot 
analysis of the proteins produced by each of the cell 
lines corroborated this finding: p8125-containing cells 
produced peptides detectable by antibodies raised 

10 against zucchini galactinol synthase, but no antibody- 
detectable peptides were produced by plasmid-free or 
p812-containing cell lines. 

Radiolabeled and unlabelled galactinol may be 
produced enzymatically by the action of galactinol 

15 synthase on UDP-galactose and myoinositol as described 
above. The possession of a readily available source of 
large quantities of recombinant galactinol synthase 
improves this synthesis, and will result in greater 
availability of this compound which has been difficult 

20 to obtain heretofore. 

EXAMPLE 4 

QS E OF SOYBEAN fTCRn fiRLAC TTN^ SEQUENCE IN PL&SMIP AS A 
BflF T^TrTTQN FRAGMENT LEN G TH POLYMORPHISM (RFLP) MARKER 

The cDNA insert from plasmid pS21 was removed from 
25 the Bluescript vector by digestion with the restriction 
enzymes Pstl and Kpnl using standard conditions as 
described in Sambrook et al. [Molecular Cloning: A 
Laboratory Manual, 2nd Ed. (1989) Cold Spring Harbor 
Laboratory Press] and labelled with 32 P using a Random 
30 Priming Kit from Bethesda Research Laboratories under 
conditions recommended by the manufacturer. The 
resulting radioactive probe was used to probe a Southern 
blot (Sambrook et al., [Molecular Cloning: A Laboratory 
Manual, 2nd Ed. (1989) Cold Spring Harbor Laboratory 
35 Press] containing genomic DNA from soybean [ Glycine ma* 
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(cultivar Bonus) and glycine soja (PI81762)], digested 
with one of several restriction enzymes. After 
hybridization and washes under standard conditions 
[Sambrook et al,, [Molecular Cloning: A Laboratory 
5 Manual, 2nd Ed. (1989) Cold Spring Harbor Laboratory 
Press], autoradiograms were obtained, and different 
patterns of hybridization (polymorphisms) were 
identified in digests performed with restriction enzymes 
Msp 1 and Bam HI . The same probe was then used to map 

10 the polymorphic pS21 loci on the soybean genome, 

essentially as described by Helentjaris et al. [(1986) 
Theor. Appl. Genet. 72:761-769]. Plasmid pS21 probe was 
applied, as described above, to Southern blots of Msp 1 
or BamHl, digested genomic DNAs isolated from 68 F2 

15 progeny plants resulting from a £. max Bonus x £. soja 
. PI817 62 cross. The bands on the autoradiograms were 
interpreted as resulting from the inheritance of either 
paternal (Bonus) or maternal (PI81762) pattern, or both 
(a heterozygote) . The resulting data were subjected to 

20 genetic analysis using the computer program Mapmaker 
[Lander et al. (1987) Genomics 1: 174-181]. In 
conjunction with previously obtained data for 436 
anonymous RFLP markers in soybean [Tingey et al. (1990) 
J. Cell. Biochem., Supplement 14E p. 291, abstract 

25 R153], position one genetic locus corresponding to the 

pS21 probe was positioned on the soybean genetic map. 

EXAMPLE 5 

PRODUCTION OF SRLftCTINQL USING 
ZUCCHINI LEAF GALACTINOL SYNTHASE 

30 Zucchini leaf galactinol synthase was prepared as 

described in Example 1, except the purification was 
stopped after the Phenyl Sepharose step and the high- 
activity fractions were concentrated to 100 mL and 
frozen. At this stage r the enzyme is of sufficient 

35 purity (1.2 I.U./mg protein) to catalyze the synthesis 
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of galactinol in. vitro from myo- inositol and 
UDP-galactose since interfering enzymatic activities 
have been removed. While UDP-galactose is commercially 
available, it is more expensive than UDP-glucose from 
5 commercially available sources. As a result, it is 

preferable to use UDP-glucose as the initial nucleotide 
sugar substrate and then convert it into UDP-galactose 
by treating it with the enzyme UDPG-4 1 -epimerase 
(EC 5.1.3.2). The pH optimum of UDPG-4 • -epimerase is 

10 higher than that for galactinol synthase so galactinol 
was produced in a two-step reaction. 

A 100-mL reaction mix was prepared that contained 
20 mM N-tristhydroxymethyl]methyl-3-aminopropanesulfonic 
acid (TAPS)/NaOH [8.7], 80 mM UDP-glucose, 200 I.U. 

15 UDPG-4* -epimerase, 80 mM myo-inositol, 2 mM MnCl2, and 
20 mM DTT. The mixture was incubated 10 minutes at 
30°C, and then sufficient N-[2-hydroxyethyl]piperazine- 
N'-[2-ethansulfonic acid] (HEPES) /NaOH and MOPS were 
added to reduce the pH to 7.6 at 30°C. Following the 

20 addition of the buffers, 40 I.U. of galactinol synthase 
was added and the mixture was incubated for 4.5 hours at 
30°C. Following the incubation, the mixture was 
lyophilized. The lyophilized powder was dissolved in 
water and purified by gel filtration on a Toyopearl TSK 

25 HW40F column, 2.6 X 75 cm. The mobile phase used was 
0.04% sodium azide in water at a flow rate of 0.5 
mL/min. Fractions were collected and analyzed for 
galactinol and myo-inositol by High Performance Anion 
Exchange Chromatography (HPAEC) using Pulsed 

30 Amperometric Detection (PAD) . Fractions containing no 
significant contamination by myo-inositol were 
lyophilized while the remainder was repurified on the 
same column. This process was repeated until the 
majority of the galactinol was isolated from the myo- 

35 inositol. Following purification, the sodium azide was 
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removed by passing the solution over a mixed-bed ion- 
exchange column (BioRad) according the vendor's 
instructions. The galactinol was f reeze-dried and 
stored over desiccant at -20°C. Identity of the 
5 galactinol was confirmed by enzymatic hydrolysis using 
a-galactosidase followed by HPAEC/PAD analysis of the 

products . 

EXAMPLE 6 

TRANSFORMATION OF BRASSICA NAPUS CULTIVAR "WESTAR" 
10 WITH 3 5 S-SOYBE AN GALACTINOL SYNTHASE CTNE 

Brassica napus cultivar "Westar" was transformed 
with the 35S-soybean galactinol synthase gene by co- 
cultivation of seedling pieces with disarmed 
Agrobacterium tumefaciens strain LBA4404 carrying the 
15 binary vector pZS194b/35S :GAS :3 'NOS as constructed in 
Example 2A. 

B . napus seeds were sterilized by stirring in 10% 
Chlorox, 0.1% SDS for thirty minutes, and then rinsed 
thoroughly with sterile distilled water. The seeds were 

20 germinated on sterile medium containing 30 mM CaC12 and 
1.5% agar, and grown for six days in the dark at 24°C. 

Liquid cultures of Agrobacteria for plant 
transformation were grown overnight at 28°C in Minimal A 
medium containing 100 mg/L kanamycin. The bacterial 

25 cells were pelleted by cent rifugat ion and resuspended at 
a concentration of 10 8 cells/ml in liquid Murashige and 
Skoog Minimal Organic medium containing 100 uM 
acetosyringone . 

B. napus seedling hypocotyls were cut into 5 mm 

30 segments which were immediately placed into the 

bacterial suspension. After 30 minutes, the hypocotyl 
pieces were removed from the bacterial suspension and 
placed onto BC-12 callus medium containing 100 uM 
acetosyringone. The plant tissue and Agrobacteria were 

35 co-cultivated for three days at 24°C in dim light. 
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10 



The co-cultivation was terminated by transferring 
the hypocotyl pieces to BC-12 callus medium containing 
200 mg/L carbenicillin to kill the Agrobacteria, and 
25 mg/L kanamycin to select for transformed plant cell 
growth. The seedling pieces were incubated on this 
medium for three weeks at 24°C under continuous light. 

After three weeks, the segments were transferred to 
BS-48 regeneration medium containing 200 mg/L 
carbenicillin and 25 mg/L kanamycin. Plant tissue was 
subcultured every two weeks onto fresh selective 
regeneration medium, under the same culture conditions 
described for the callus medium. Putatively transformed 
calli grew rapidly during this phase; as calli reached a 
diameter of about 2mm, they were removed from the 
15 hypocotyl pieces and placed on the same medium lacking 
kanamycin . 

Shoots began to appear within several weeks after 
transfer to BS-48 regeneration medium. As soon as the 
shoots formed discernable stems, they were excised from 
20 the calli, transferred to MSV-1A elongation medium, and 
moved to a 16: 8-hour photoperiod at 24°C. 

Once shoots had elongated several internodes, they 
were cut above the agar surface and the cut ends were 
dipped in Rootone. Treated shoots were planted directly 
25 into wet Metro-Mix 350 soiless potting medium. The pots 
were covered with plastic bags which were removed when 
the plants were clearly growing — after about ten days. 

Plants were grown at 20°C under a 16: 8-hour 
photoperiod. When the primary flowering stem began to 
elongate, it was covered with a mesh pollen-containment 
bag to prevent outcrossing. Self-pollination was 
facilitated by shaking the plants several times each 
day. Seeds derived from self-pollinations were 
harvested about three months after planting. 



30 
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Two putative trans formants, 124-6 and 124-25, were 
verified by Polymerase Chain Reaction and/or Southern 
blot assays as having been transformed by the 
35S-soybean galactinol synthase gene. Seeds from these 
5 plants were subsequently analyzed for soluble 
carbohydrate composition according to Example 7. 



Minimal A Bacterial Growth Medium 
Dissolve in distilled water: 
10 10,5 grams potassium phosphate, dibasic 

4.5 grams potassium phosphate, monobasic 
1.0 gram ammonium sulfate 
0.5 gram sodium citrate, dihydrate 
Make up to 979 mLs with distilled water 
15 Autoclave 

Add 20 mLs filter-sterilized 10% sucrose 
Add 1 mL filter-sterilized 1 M MgS04 
Brassica Callus Medium BC-12 
Per liter: 

20 Murashige and Skoog Minimal Organic Medium (MS 

salts, 100 mg/L i-inositol, 0.4 mg/L thiamine; 

GIBCO #510-3118) 

30 grams sucrose 

18 grams mannitol 
25 1.0 mg/L 2,4-D 

3.0 mg/L kinetin 

0 . 6% agarose 

Brassica Reoperation Medium BS-48 
30 Murashige and Skoog Minimal Organic Medium 

Gamborg B5 Vitamins (SIGMA #G1019) 
10 grams glucose 
250 mg xylose 
600 mg MES 
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0.4% agarose 
pH 5.7 

Filter-sterilize and add after autoclaving: 
2.0 mg/L zeatin 
5 0.1 mg/L IAA 

Brassica Shoot Elongation Medium MSV-1A 

Murashige and Skoog Minimal Organic Medium 
Gamborg B5 Vitamins 
10 grams sucrose 
10 0.6% agarose 

pH 5.8 

KXAMPLE 7 

T pF.WTTFTr.ATTON OF TMPROV T.n P. ARBO H YOR ATE COMPOSITION 
n p PANOT-A LINES 124-6 AND 124-25 
15 Assays fo r Balacfcinol Synthase, RflffinQSe 

^rrhar^fifl. an H Rnlnble Carbohydrates 
Galactinol synthase activity was measured in seed 
of fertile plants that had been transformed with the 
35S: Soybean GaS construction. Seeds were extracted in 
20 buffer (0.25 M N- [2 [Hydroxyethyl] piperizine-N '- [2- 

ethanesulfonic acid], (HEPES) /NaOH at pH 8.0, 10 mM DTT 
and 1 mM MnCl 2 ) in a 5:1 (volume/weight) ratio. For 
each sample, approximately 100 mg of seed tissue was 
used for extraction of the enzyme. Following the 
25 addition of buffer, the seeds were ground twice for 15 
sec using a tissue homogenizer. The seed extract was 
the centrifuged for 5 min at 15,000 g at.2°C. 
Galactinol synthase activity was assayed as described in 
Example 1. 

30 Raffinose saccharide content was determined using 

two distinct assays. Total a-galactoside or raffinose 
saccharide content were determined on an "as is" basis 
for the ground material. Approximately 30 seeds from a 
given plant were ground in a mortar and pestle. 

35 Approximately 30 mg of the resultant powder was weighed 
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into a 13 x 100 ram screw cap tube and 1.6 mL of 
chloroform and 1.5 mL of methanol : water (4:3, v/v) was 
added. The tubes were then capped, placed in racks and 
shaken on a rotary shaker for 150 min at 1800 rpm at 
5 room temperature. After extraction, the contents of the 
tubes were allowed to settle for 15 min. After 
settling, a 15 JIL aliquot of the methanol: water phase 

was placed in a well of a 96 well microtiter plate and 
dried at 45°C for 20 min. At this point the raffinose 
10 saccharide content -was determined in one of two assays . 
The first involved a coupled enzymatic assay that 
employs a-galactosidase and galactose dehydrogenase as 

described previously (Schiweck and Busching (1969) 
Zucker 22:377-384, Schiweck and Busching (1975) Zucker 
15 28:242-243, Raffinose Detection Kit®, Boehringer 

Mannheim GMBH, Catalog Number 428 167) with 
modifications of the assay conditions. The 
modifications of the assay included addition of Bovine 
Serum Albumin (15 mg/mL) to the assay and 

20 a-gaiactosidase buffers, increasing the temperature and 
time of the a-galactosidase incubation from room 
temperature to 45°C and 30 min, and increasing the time 
of the galactose dehydrogenase incubation from 20 min to 
60 min and using stachyose instead of raffinose for the 

25 a-galactoside standard. After incubation, the A3 40 of 
the samples were determined on a BIO-TEK™ Model EL340 
Microplate reader. The amount of a-galactosides present 
in the samples were determined by comparison to known 
quantities of the stachyose standard. 

30 An additional assay was used to eliminate the 

potential for artifacts that could result from the use 
of an enzymatically based assay (e.g., presence of a 
novel inhibitor of a-galactosidase or galactose 

dehydrogenase in the seed) , as well as to obtain more 
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complete information of the individual soluble 
carbohydrates present in the seed, 

A High Performance Anion Exchange Chromatography/ 
Pulsed Amperometric (HPAEC/PAD) assay was used for 
5 determining the content of individual raffinose 
saccharides (e.g., stachyose, raffinose, and 
galactinol) , as well as sucrose, the other major soluble 
sugar present in the seed. Conditions for the grinding 
and extraction of the seed were identical to those used 

10 for the previous assay. A 250 mL aliquot of the aqueous 
phase was removed and dried under reduced pressure at 
75°C. The dried material was then dissolved in 1 mL of 
water, mixed vigorously for 30 sec. A 100 jil aliquot 
was removed and diluted to 1 mL with water. The sample 

15 was mixed thoroughly again and then centrifuged for 

3 min at 10,000 rpm. Following centrifugation, a 20 J1L 
sample was analyzed on a Dionex™ PA1 column using 150 mM 
NaOR at 1.3 mL/min at room temperature. The Dionex™ PAD 
detector was used with Ei=0.05 v, E 2 =0.60 v and 

20 E 3 = -0.60 v and an output range of 1 nA. Galactinol, 
glucose, fructose, sucrose, raffinose, and stachyose 
were well separated by the chromatographic conditions. 
The carbohydrate content of the samples was determined 
by comparison to authentic standards. 

25 Results obtained from the carbohydrate analyses 

were subjected to analysis of variance using the 
software SuperANOVA (Abacus Concepts, Inc., 1984 Bonita 
Avenue, Berkeley, CA 94704). Fisher's Protected LSD 
was used as the post-hoc test for comparison of means. 

30 Using these protocols, two lines with identified 

with higher galactinol synthase activity in the 
harvested seed. Myo-inositol-dependent galactinol 
synthase activity of the lines are shown below: 
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Galactinol Synthase 
Line urool/min/g seed 

124-6 251 
124-25 253 
Westar 77 



The total a-galactoside content and the soluble 
carbohydrate composition of these lines are shown below: 
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The results indicate that seed from lines 124-6 and 
124-25 had about three times greater activity of 
galactinol synthase compared to that observed in seed 
from Westar. Despite the increased amount of galactinol 
5 synthase activity, the total a-galactoside content of 

the transformed lines was significantly less than that 

of Westar. This was confirmed using the HPAEC/PAD 

method as well, and this analysis indicated that the 
lower total a-galactoside content was primarily 

10 attributed to a substantially reduced stachyose content 
in the seed of 124-6 and 124-25, Differences in 
raffinose and galactinol content among the lines were 
not significant at the 0.05 level. In addition to the 
reduction in total a-galactoside and stachyose content, 

15 the sucrose content was higher in 125-6 and 125-25 
compared to that from Westar. Collectively/ these 
results were unexpected in that a reduction in the 
raffinose saccharide content and an increase in the 
sucrose content was not anticipated from transforming 

20 plants with a 35S: soybean GaS construction (SEQ ID 
NO: 6). The mechanism whereby the transformation 
described herein results in canola plants with seeds 
that have an improved carbohydrate composition remains 
to be determined. Nevertheless, a significant 

25 improvement in the soluble carbohydrate composition of 
the seed was observed. 

B RI EF DE SCRIPTION QF THE DRAWINGS 
Figure 1 shows the construction of Plasmid 
p35S:GAS. 

30 Figure 2 shows the construction of p35S:GAS. 

Figure 3 shows the construction of pZS194b. 
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SEQSEBCE listing 
(1) GENERAL INFORMATION: 

(i) APPLICANT: John W. Pierce 

Phillip S. Kerr 
Mary F. Becker-Manley 
(ii) TITLE OF INVENTION: Nucleotide Sequences 

of Galactinol 
Synthase from 
Zucchini and Soybean 
(iii) NUMBER OF SEQUENCES: 12 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : E. I. du Pont de Nemours 

and Company 

(B) STREET: 1007 Market Street 

(C) CITY: Wilmington 

(D) STATE: Delaware 

(E) COUNTRY: U.S.A. 

(F) ZIP: 19898 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 

1.0 MB 

(B) COMPUTER: Macintosh 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/735,066 

(B) FILING DATE: 24 JULY 1991 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: LINDA AXAMETHY FLOYD 

(B) REGISTRATION NUMBER: 33,692 

(C) REFERENCE /DOCKET NUMBER: BB-1032-A 
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(ix) 



TELECOMMUNICATION INFORMATION: 



(A) TELEPHONE : 

(B) TELEFAX: 

(C) TELEX: 



(302)992-4929 
(302)892-7949 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: In vitro synthesized DNA 
(iii) HYPOTHETICAL: Degenerate oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GCGGCCGCAT GCCAAAYGGR TAYTTYTAYG C 31 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: In vitro synthesized DNA 

(iii) HYPOTHETICAL: Degenerate oligonucleotide 

encoding peptide SEQ ID NO: 7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
TCRATMACRG GRTATAARTA 20 



encoding peptide SEQ ID NO: 7 
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INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: In vitro synthesized DNA 

(iii) HYPOTHETICAL: Degenerate oligonucleotide 

encoding peptide SEQ ID NO: 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GCGGCCGCAT GTAYTTYAAY GAYATHTAYA A 31 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: In vitro synthesized DNA 

(iii) HYPOTHETICAL: Degenerate oligonucleotide 

encoding peptide SEQ ID NO: 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GGRATWGGYT TRTARATRTC RTT 23 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1265 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESSS : single 

(D) TOPOLOGY: linear 



WO 93/02196 
(2) 



(ii) MOLECULE TYPE: cDNA to mRNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cucurfritfl r>epo 

(B) STRAIN: Burpee's hybrid 
(F) TISSUE TYPE: leaf 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA to mRNA 

(B) CLONE: p812 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GGCACGAGTG TTGTTCCACC CACATAATCA AT 32 

ATG TCT CCG GCT GCC ACC GAA ACT GCC ATC GAG TCG ACC GAT GCC 77 
Met Ser Pro Ala Ala Thr Glu Thr Ala lie Glu Ser Thr Asp Ala 
15 10 15 

CCC AAG AGG GCA TTC GTG ACG TTC TTG GCC GGT AAT GGA GAC TAT 122 
Pro Lys Arg Ala Phe Val Thr Phe Leu Ala Gly Asn Gly Asp Tyr 

20 25 30 

TGG AAA GGT GTG GTT GGA TTG GCA AAG GGT CTC AGA AAG GTC AAG 167 
"Trp Lys Gly Val Val Gly Leu Ala Lys Gly Leu Arg Lys Val Lys 

35 40 ~ 45 

ACC GTC TAC CCT CTC ATT GTA GCT GTC CTG CCT GAT GTT CCC GAA 212 
Thr Val Tyr Pro Leu lie Val Ala Val Leu Pro Asp Val Pro Glu 

50 55 ~ 60 

■ 

GAC CAT CGC CAG ATT CTC GAG TAT CAG GGA TGC ATC GTC CGA GAA 257 
Asp His Arg Gin lie Leu Glu Tyr Gin Gly Cys lie Val Arg Glu 

65 70 75 

ATC GAG CCT GTT TAC CCC CCT GCA AAC CAG ACT CAA TTT GCG ATG 302 
lie Glu Pro Val Tyr Pro Pro Ala Asn Gin Thr Gin Phe Ala Met 

85 90 95 

GCA TAC TAT GTT ATC AAC TAC TCA AAG CTT AGG ATT TGG GAG TTC 347 
Ala Tyr Tyr Val lie Asn Tyr Ser Lys Leu Arg lie Trp Glu Phe 

105 110 " 115 

GTG GAG TAT GAG AAG CTG ATA TAT TTG GAT GGG GAC ATT CAA GTG 392 
Val Glu Tyr Glu Lys Leu lie Tyr Leu Asp Gly Asp lie Gin Val 

120 125 130 

TTT GAG AAC ATC GAT CAT CTG TTC GAA ATG CCA AAT GGA TAC TTC 437 
Phe Glu A3n lie Asp His Leu Phe Glu Met Pro Asn Gly Tyr Phe 

135 140 145 

TAC GCC GTG ATG GAC TGC TTC TGT GAG AAG ACA TGG AGT AAC TCA 482 
Tyr Ala Val Met Asp Cys Phe Cys Glu Lys Thr Trp Ser A3n Ser 

150 155 160 
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CCG CAG TAC AAG ATT GGT TAC TGC CAG CAA TGC CCT GAC AAA GTC 527 
Pro Gin Tyr Lys lie Gly Tyr Cys Gin Gin Cys Pro Asp Lys Val 

165 1™ 175 

AAG TGG CCT GTT GAG GAA ATG GGA AAC CCA CCC CCT CTT TAC TTC 572 
Lvs Trp Pro Val Glu Glu Met Gly Asn Pro Pro Pro Leu Tyr Phe 
Y F 180 185 190 

AAC GCC GGA TTT TTT GTG TAT GAA CCC GAC CTC TTC ACC TAC AAG 617 
Asn Ala Gly Phe Phe Val Tyr Glu Pro Asp Leu Phe Thr Tyr Lys 

195 200 205 

GAT CTT CTC GAA ACT TGC AAG GCC ACC ACT CCA ACC TTG TTT GCT 662 
Aso Leu Leu Glu Thr Cys Lys Ala Thr Thr Pro Thr Leu Phe Ala 

210 215 220 

GAG CAG GAC TTT CTG AAC ATG TAC TTC AAC GAC ATT TAC AAA CCC 707 
Glu Gin Asp Phe Leu Asn Met Tyr Phe Asn Asp lie Tyr Lys Pro 

225 230 235 

ATT CCT CCC ATT TAC AAC CTC GTC ATG GCC ATG TTG TGG CGT CAT 752 
He Pro Pro He Tyr Asn Leu Val Met Ala Met Leu Trp Arg His 

240 245 250 

CCC GAG AAC ATC GAC GTA GAC AAA GTC AAA GTT GTC CAC TAC TGT 797 
Pro Glu Asn He Asp Val Asp Lys Val Lys Val Val His Tyr Cys 

255 260 265 

GCG GCG GGA TCA AAG CCG TGG AGG TAC ACA GGA GAG GAA GAG AAC 842 
Ala Ala Gly Ser Lys Pro Trp Arg Tyr Thr Gly Glu Glu Glu Asn 

270 275 280 

ATG GAC AGA GAA GAC ATA AAA ATG TTG GTG AAG AAA TGG TGG GAG 887 
Met Asp Arg Glu Asp He Lys Met Leu Val Lys Lys Trp Trp Glu 

285 290 295 

GTT TAT GAA GAT GAA TCT TTG GAC TAC CAA AAT GTT ATC AAA TCT 932 
Val Tvr Glu Asp Glu Ser Leu Asp Tyr Gin Asn Val He Lys Ser 

300 305 310 

GAA ACC AAA GAA GCA ACC AAC GTC GCG CCT TTG GTC TCC GTG TTG 977 
Glu Thr Lys Glu Ala Thr Asn Val Ala Pro Leu Val Ser Val Leu 

315 320 325 

TCG GAG GCT GAA GTT GTC AAC CAT ATC ACA GCT CCT TCT GCT GCT 1022 
Ser Glu Ala Glu Val Val Asn His He Thr Ala Pro Ser Ala Ala 

330 335 340 

1025 

TAA 

ATTATATATA CATATATATA TATATATATA TAGATAATAT GTATGAGTGT 1075 

TTGTGGTGTG AGGCCAAATA GTATTATTAT TATTAAGCTT ATTATTATTA 1125 

TGTACGGTAG CTCAGGTGGG GGTTGTTTTG TCCTTTGTGC ATGCAACTCC 1175 

CAACCTTSWT S RYKRCWWYM MKTYWSYWWM CAAATTTTAT GAATAACCCT 1225 
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GCTTTTATGT GTCTTTCTAA AAAAAAAAAA AAAAAAAAAA 1265 



<2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1386 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES SS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Glycine max 

(B) STRAIN: Cultivar Wye 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA to mRNA 

(B) CLONE: pS21 

(x) PUBLICATION INFORMATION: Unpublished 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6:" 



GTTTGTTTTC AAAGTGTGTT TTGTTTCCCA AATCCTACTC TTGTGACCAC 50 
AACCCTTCCT CCTCTTTCTT TTGAAACCTC TTTTTTTCTA TTCCCCAACC 100 
AAACAAGCAA ACGCTACTCA CTCATCATCA CTGAGATC 138 



ATG GCT CCT AAT ATC ACC ACT GTC AAA ACC ACC ATC ACC GAC GCT 163 
Met Ala Pro Asn lie Thr Thr Val Lys Thr Thr lie Thr Asp Ala 
15 10 15 

CAA GCC AAG GTC GCC ACC GAT CAT GGT CGT GCC TAC GTC ACC TTC 208 
Gin Ala Ly3 Val Ala Thr Asp His Gly Arg Ala Tyr Val Thr Phe 

20 25 30 

CTC GCC GGA AAC GGT GAC TAT GTG AAA GGT GTC GTT GGC TTG GCA 253 
Leu Ala Gly Asn Gly Asp Tyr Val Lys Gly Val Val Gly Leu Ala 

35 40 45 

AAA GGT CTG AGA AAA GTG AAG AGC ATG TAC CCT CTG GTG GTT GCA 298 
Lys Gly Leu Arg Lys Val Lys Ser Met Tyr Pro Leu Val Val Ala 

50 55 60 

GTG CTA CCC GAT GTT CCC CAA GAT CAC CGC AAC ATT CTC ACC TCC 343 
Val Leu Pro Asp Val Pro Gin Asp His Arg Asn lie Leu Thr Ser 

65 70 75 
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CAA GGT TGC ATT GTT AGA GAG ATT GAG CCC GTG TAC CCC CCA GAG 
Gin Glv Cys lie Val Arg Glu lie Glu Pro Val Tyr Pro Pro Glu 

80 - 85 90 



388 



AAT CAA ACC CAG TTT GCC ATG GCA TAT TAC GTC ATC AAC TAT TCC 433 
Asn Gin Thr Gin Phe Ala Met Ala Tyr Tyr Val lie Asn Tyr Ser 

95 100 105 

AAG CTA CGT ATT TGG GAG TTT GTG GAG TAC AGC AAG ATG ATA TAC 478 
Lys Leu Arg He Trp Glu Phe Val Glu Tyr Ser Lys Met He Tyr 

110 H5 120 

CTA GAC GGT GAT ATC CAA GTT TTT GAC AAC ATT GAC CAC TTG TTT 523 
Leu Asp Gly Asp He Gin Val Phe Asp Asn He Asp His Leu Phe 

125 130 135 

GAC TTG CCT GAT AAC TAC TTC TAT GCG GTG ATG GAC TGT TTC TGT 568 
Asp Leu Pro Asp Asn Tyr Phe Tyr Ala Val Met Asp Cys Phe Cys 

140 145 150 

GAG CCA ACT TGG GGC CAC ACT AAA CAA TAT CAG ATC GGT TAC TGC 613 
Glu Pro Thr Trp Gly His Thr Lys Gin Tyr Gin He Gly Tyr Cys 

155 160 165 

CAG CAG TGC CCC CAT AAG GTT CAG TGG CCC ACT CAC TTT GGG CCC 658 
Gin Gin Cys Pro His Lys Val Gin Trp Pro Thr His Phe Gly Pro 

170 175 180 

AAA CCT CCT CTC TAT TTC AAT GCT GGC ATG TTT GTG TAT GAG CCC 703 
Lys Pro Pro Leu Tyr Phe Asn Ala Gly Met Phe Val Tyr Glu Pro 

185 190 195 

AAT TTG GCT ACT TAC CGT GAC CTC CTT CAA ACA GTC CAA GTC ACC 748 
Asn Leu Ala Thr Tyr Arg Asp Leu Leu Gin Thr Val Gin Val Thr 

200 205 210 

CAG CCC ACT TCC TTT GCT GAA CAG GAT TTT TTG AAC ATG TAC TTC 793 
Gin Pro Thr Ser Phe Ala Glu Gin Asp Phe Leu Asn Met Tyr Phe 

215 220 225 

AAG GAC AAA TAT AGG CCA ATT CCT AAT GTC TAC AAT CTT GTG CTG 838 
Lvs Asp Lys Tyr Arg Pro He Pro Asn Val Tyr Asn Leu Val Leu 

230 235 240 

GCC ATG CTG TGG CGT CAC CCT GAG AAC GTT GAG CTT GAC AAA GTT 883 
Ala Met Leu Trp Arg His Pro Glu Asn Val Glu Leu Asp Lys Val 

245 250 255 

AAA GTG GTT CAC TAC TGT GCT GCT GGG TCT AAG CCT TGG AGG TAC 928 
Lys Val Val His Tyr Cys Ala Ala Gly Ser Lys Pro Trp Arg Tyr 

260 265 270 

ACT GGG AAG GAG GAG AAT ATG GAG AGA GAA GAT ATC AAG ATG TTA 973 
Thr Gly Lys Glu Glu Asn Met Glu Arg Glu Asp He Lys Met Leu 

275 280 285 
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GTG AAA AAG TGG TGG GAT ATA TAT GAG GAT GAG ACT TTG GAC TAC 1018 
Val Lys Lys Trp Trp Asp lie Tyr Glu Asp Glu Thr Leu Asp Tyr 

290 295 300 

AAC AAT CCA CTC AAT GTG GAT AAG TTC ACT GCG GCA CTT ATG GAG 1063 
Asn Asn Pro Leu Asn Val Asp Lys Phe Thr Ala Ala Leu Met Glu 

305 310 315 

GTT GGT GAA GTC AAG TTC GTC CGT GCC CCA TCT GCT GCT TAA 
Val Gly Glu Val Lys Phe Val Arg Ala Pro Ser Ala Ala 

320 325 

GAGTGTCTTT GGAAATCAAG TGTGATCCAA GTACATGTAC AAAGTCATAC 

ATCATTACAT TAACTTTTAT GTATTTCTAA AAGTCATACA TCATTACATT 

AAGTTTTATG TATTTCTAAA GTCTTAAGAC TTAAGAGGAC CTTTTTTATK 

KKKCCCGCTT TTCTTTTTTT CTTTTTCCAA TTCTGTCATT GTAAAGSRGA 

GAATACCGTA TCCTTAATTT TATAAATGGA TATGAATTTT ATTTGTACTA 

AAGGGGGGGC CGGTACCAAT TCGCCTATAG T 



. (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE; protein 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: Peptide 
(v) FRAGMENT TYPE: CNBr cleavage product 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cucurbit a pepo 

( B ) STRAIN : unknown 
(F) TISSUE TYPE: leaf 

(x) PUBLICATION INFORMATION: unpublished 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Pro Asn Gly Tyr Phe Tyr Ala Val lie Asp 
1 5 10 



1105 

1155 
1205 
1255 
1305 
1355 
1386 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION : Peptide 
(v) FRAGMENT TYPE: CNBr cleavage product 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cucurbit a pePP 

(B ) STRAIN : unknown 
(F) TISSUE TYPE: leaf 

(x) PUBLICATION INFORMATION: unpublished 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Tyr Phe Asn Asp He Tyr Lys Pro He Pro 
1 5 10 

Xaa He Tyr Asn 
11 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: protein 

( C ) STRANDEDNES S : s ingle 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: 

A) DESCRIPTION: Peptide 
(v) FRAGMENT TYPE: CNBr cleavage product 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hnnirbita pepp 

(B) STRAIN: unknown 
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(F) TISSUE TYPE: leaf 
(x) PUBLICATION INFORMATION: unpublished 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Tyr Phe Asn Asp lie Tyr Lys Pro lie Pro 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: protein 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: Peptide 
(v) FRAGMENT TYPE: N-terminus 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cucurbit a pepo 

(B) STRAIN: unknown 
(F) TISSUE TYPE: leaf 

(x) PUBLICATION INFORMATION: unpublished 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Pro Ala Ala Thr Glu Thr Ala lie Glu Xaa 
15 10 

Thr Asp Als Pro Lys Arg Ala Phe Val 
11 15 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GTGGATCCCC CGGGCTGC&G GAATTCGGCA CGAGTGTTGT 40 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GTGGATCCCC CAATTCGGCA CGAGTGTTGT 30 
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What is claimed is: 

1 . An isolated nucleic acid fragment comprising a 
5 nucleotide sequence encoding plant galactinol synthase. 

2. A nucleic acid fragment according to Claim 1 
wherein the plant is selected from the group consisting 
of soybean, zucchini, canola f cotton, edible legumes, 
sugar beet, coniferous horticultural species, Stachys . 

10 maize and tobacco. 

3. A nucleic acid fragment according to Claim 2 
wherein the plant is selected from the group consisting 
of soybean, zucchini and canola. 

4. A chimeric gene capable of being expressed in 
15 transformed plants comprising the nucleic acid fragment 

of any of Claims 1-3 operably linked to a suitable 
regulatory sequence. 

5. A chimeric gene capable of being expressed in 
transformed plants comprising a nucleic acid fragment 

20 that encodes antisense RNA complementary to plant 
galactinol synthase. 

6. A chimeric gene capable of being expressed in 
transformed microorganisms comprising the nucleic acid 
fragment of any of Claims 1-3 operably linked to a 

25 suitable regulatory sequence. 

7. A host cell transformed with the chimeric gene 
of Claim 4. 

8. A plant transformed with the chimeric gene of 
Claim 4 . 

30 9. A plant according to Claim 8 wherein the plant 

is selected from the group consisting of soybean, 
zucchini, canola, cotton, edible legumes, sugar beet, 
coniferous horticultural species, Stachys , maize and 
tobacco. 
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10. A plant according to Claim 9 wherein the plant 
is selected from the group consisting of soybean, 
zucchini and canola. 

11. Seeds obtained from the plants of Claim 8, 
5 12. Seeds obtained from the plants of Claim 9* 

13. Seeds obtained from the plants of Claim 10. 

14. A microorganism transformed with the chimeric 

gene of Claim 6. 

15. A microorganism according to Claim 14 wherein 

10 the microorganism is £. £QlX- 

16. A method for obtaining plants and plant cells 
containing altered levels of raffinose saccharides 
comprising: 

(a) transforming a plant cell with a chimeric 
15 gene that encodes antisense RNA complementary to 

galactinol synthase; 

(b) growing fertile plants from said 

transformed plant cell; and 

(c) screening progeny plants for the desired 
20 levels of raffinose saccharides. 

17 . A method for obtaining plants and plant cells 
containing altered levels of raffinose saccharides 
comprising: 

(a) transforming a plant cell with a chimeric 

25 gene of Claim 4; 

(b) growing fertile plants from said 

transformed plant cell; and 

(c) screening progeny plants for the desired 
levels of raffinose saccharides. 

30 18. A method for obtaining plants and plant cells 

containing altered levels of sucrose comprising: 

(a) transforming a plant cell with a chimeric 
gene that encodes antisense RNA complementary to 
galactinol synthase; 
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(b) growing fertile plants from said 
transformed plant cell; and 

(c) screening progeny plants for the desired 
levels of sucrose. 

5 19. A method for obtaining plants and plant cells 

containing altered levels of sucrose comprising: 

(a) transforming a plant cell with a chimeric 
gene of Claim 4; 

(b) growing fertile plants from said 
10 transformed plant cell; and 

(c) screening progeny plants for the desired 
levels of sucrose. 

20. A method for obtaining plants and plant cells 
containing altered levels of sucrose and altered levels 

15 of raffinose saccharides comprising: 

(a) transforming a plant cell with a chimeric 
gene that encodes antisense RNA complementary to 
galactinol synthase; 

(b) growing fertile plants from said 
20 transformed plant cell; and 

(c) screening progeny plants for the desired 
levels of sucrose and raffinose saccharides. 

21. A method for obtaining plants and plant cells 
containing altered levels of sucrose and altered levels 

25 of raffinose saccharides comprising: 

(a) transforming a plant cell with a chimeric 
gene of Claim 4; 

(b) growing fertile plants from said 
transformed plant cell; and 

30 (c) screening progeny plants for the desired 

levels of sucrose and raffinose saccharides. 

22. A method for producing galactinol comp.; sing: 
(a) transforming £. with the chimeric 

gene of Claim 6; 
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(b) growing said £. coli under conditions 
suitable for expression of galactinol synthase; 

(c) isolating a galactinol synthase enzyme 
from the £. of step (b) ; and 

5 (d) contacting said isolated galactinol 

synthase enzyme with appropriate substrates in order to 

produce galactinol. 

23- A method for producing galactinal comprising: 

(a) isolating a galactinol synthase enzyme 

10 from a plant; and 

(b) contacting said isolated galactinol 
synthase enzyme with appropriate substrates in order to 
produce galactinol. 

24. The method of Claim 23 wherein the galactinol 
15 synthase enzyme is isolated from zucchini. 

25. A method of RFLP breeding of altered levels of 
raffinose saccharides and sucrose traits in soybeans 
comprising: 

(a) crossing two soybean varieties differing 
20 in the traits for raffinose saccharides and sucrose 

levels; 

(b) making a Southern blot of restriction 
enzyme digested genomic DNA isolated from several 
progeny plants resulting from the cross of step (a) ; and 

25 (c) hybridizing the Southern blot with the 

radiolabeled nucleic acid fragment of any of 
Claims 1-3. 

26. A method of varying the level of D-galactose 
containing oligosaccharides of sucrose in plants in 

30 response to end-user requirements, comprising the steps 
of: 

(a) combining the nucleic acid fragment of 
SEQ ID NO: 5 or 6 with suitable regulatory sequences for 
expression and localization in plant tissues; 
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(b) transforming a plant cell with the 
product of step (a) ; 

(c) regenerating plants from said transformed 
plant cell of step (b) to obtain mature plants; 

5 (d) screening the seeds of the plants of step 

(c) for the desired variation in amino acid level. 

27. An isolated nucleic acid fragment comprising a 
nucleotide sequence corresponding to or substantially 
homologous to SEQ ID NO: 5 encoding plant galactinol 

10 synthase. 

28. An isolated nucleic acid fragment comprising a 
nucleotide sequence corresponding to or substantially 
homologous to SEQ ID NO: 6 encoding plant galactinol 
synthase . 

15 
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FIGURE 2/3 
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